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SOCIAL CAPITAL: Its Origins and 
Applications in Modern Sociology 


Alejandro Portes 
Department of Sociology, Princeton University, Princeton, New Jersey 08540 
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ABSTRACT 


This paper reviews the crigins and definitions of social capital in the writings 
of Bourdieu, Loury, and Coleman, among other authors. It distinguishes four 
sources of social capital and examines their dynamics. Applications of the 
concept in the sociological literature emphasize its role in social control, in 
family support, and in benefits mediated by extrafamilial networks. I provide 
examples of each of these positive functions. Negative consequences of the 
same processes also deserve attention for a balanced picture of the forces at 
play. I review four suck consequences and illustrate them with relevant ex- 
amples. Recent writings on social capital have extended the concept from an 
individual asset to a fea:ure of communities and even nations. The final sec- 
tions describe this conceptual stretch and examine its limitations. I argue 
that, as shorthand for the positive consequences of sociability, social capital 
has a definite place in sociological theory. However, excessive extensions of 
the concept may jeopardize its heuristic value 
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Portes is the author of some 200 articles and chapters on national devel- 
opment, international migration, Latin American and Caribbean urbaniza- 
tion, and economic sociology His most recent books include City on the 
Edge, the Transformation of Miami (winner of the Robert Park award for 
best book in urban sociology and of the Anthony Leeds award for best book 
in urban anthropology in 1995); The New Second Generation (Russell Sage 
Foundation 1996); Caribbean Cities (Johns Hopkins University Press); and 
Immigrant America, a Portrait. The latter book was designated as a centen- 
nial publication by the University of California Press. It was originally pub- 
lished in 1990; the second edition, updated and containing new chapters on 
American immigration policy and the new second generation, was published 
in 1996. 


Introduction 


During recent years, the concept of social capital has become one of the most 
popular exports from sociological theory into everyday language. Dissemi- 
nated by a number of policy-oriented journals and general circulation maga- 
zines, social capital has evolved into something of a cure-all for the maladies 
affecting society at home and abroad. Like other sociological concepts that 
have traveled a similar path, the original meaning of the term and its heuristic 
value are being put to severe tests by these increasingly diverse applications. 
As in the case of those earlier concepts, the point is approaching at which so- 
cial capital comes to be applied to so many events and in so many different 
contexts as to lose any distinct meaning. 

Despite its current popularity, the term does not embody any idea really new 
to sociologists. That involvement and participation in groups can have positive 
consequences for the individual and the community is a staple notion, dating 
back to Durkheim’s emphasis on group life as an antidote to anomie and self- 
destruction and to Marx’s distinction between an atomized class-in-itself and a 
mobilized and effective class-for-itself. In this sense, the term social capital 
simply recaptures an insight present since the very beginnings of the disci- 
pline. Tracing the intellectual background of the concept into classical times 
would be tantamount to revisiting sociology’s major nineteenth century sources. 
That exercise would not reveal, however, why this idea has caught on in recent 
years or why an unusual baggage of policy implications has been heaped on it. 

The novelty and heuristic power of social capital come from two sources. 
First, the concept focuses attention on the positive consequences of sociability 
while putting aside its less attractive features. Second, it places those positive 
consequences in the framework of a broader discussion of capital and calls atten- 
tion to how such nonmonetary forms can be important sources of power and 1n- 
fluence, like the size of one's stock holdings or bank account. The potential fungi- 
bility of diverse sources of capital reduces the distance between the sociologi- 
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cal and economic perspectives and simultaneously engages the attention of 
policy-makers seeking less costly, non-economic solutions to social problems. 

In the course of this review, I limit discussion to the contemporary reemer- 
gence of the idea to avoid a lengthy excursus into its classical predecessors. To 
an audience of sociologists, these sources and the parallels between present so- 
cial capital discussions and passages in the classical literature will be obvious. 
I examine, first, the principal authors associated with the contemporary usage 
of the term and their different approaches to it. Then I review the various 
mechanisms leading to the emergence of social capital and its principal appli- 
cations in the research literature. Next, I examine those not-so-desirable con- 
sequences of sociability that are commonly obscured in the contemporary lit- 
erature on the topic. This discussion aims at providing some balance to the fre- 
quently celebratory tone with which the concept is surrounded. That tone is es- 
pecially noticeable in those studies that have stretched the concept from a 
property of individuals and families to a feature of communities, cities, and 
even nations. The attention garnered by applications of social capital at this 
broader level also requires some discussion, particularly in light of the poten- 
tial pitfalls of that conceptual stretch. 

Definitions 

The first systematic contemporary analysis of social capital was produced by 
Pierre Bourdieu, who defined the concept as “the aggregate of the actual or po- 
tential resources which are linked to possession of a durable network of more 
or less institutionalized relationships of mutual acquaintance or recognition" 
(Bourdieu 1985, p. 248; 1980). This initial treatment of the concept appeared 
in some brief “Provisional Notes" published in the Actes de la Recherche en 
Sciences Sociales in 1980. Because they were in French, the article did not gar- 
ner widespread attention in the English-speaking world; nor, for that matter, 
did the first English translation, concealed in the pages of a text on the sociol- 
ogy of education (Bourdieu 1985). 

This lack of visibility is lamentable because Bourdieu's analysis is arguably 
the most theoretically refined among those that introduced the term in contem- 
porary sociological discourse. His treatment of the concept is instrumental, fo- 
cusing on the benefits accruing to individuals by virtue of participation in 
groups and on the deliberate construction of sociability for the purpose of cre- 
ating this resource. In the original version, he went as far as asserting that "the 
profits which accrue from membership in a group are the basis of the solidarity 
which makes them possible" (Bourdieu 1985, p. 249). Social networks are not 
a natural given and must be constructed through investment strategies oriented 
to the institutionalization of group relations, usable as a reliable source of other 
benefits. Bourdieu's definition makes clear that social capital is decomposable 
into two elements: first, the social relationship itself that allows individuals to 
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claim access to resources possessed by their associates, and second, the 
amount and quality of those resources. 

Throughout, Bourdieu's emphasis is on the fungibility of different forms of 
capital and on the ultimate reduction of all forms to economic capital, defined 
as accumulated human labor. Hence, through social capital, actors can gain di- 
rect access to economic resources (subsidized loans, investment tips, protected 
markets); they can increase their cultural capital through contacts with experts 
or individuals of refinement (i.e. embodied cultural capital); or, alternatively, 
they can affiliate with institutions that confer valued credentials (i.e. institu- 
tionalized cultural capital). 

On the other hand, the acquisition of social capital requires deliberate invest- 
ment of both economic and cultural resources. Though Bourdieu insists that the 
outcomes of possession of social or cultural capital are reducible to economic 
capital, the processes that bring about these alternative forms are not. They 
each possess their own dynamics, and, relative to economic exchange, they are 
characterized by less transparency and more uncertainty. For example, trans- 
actions involving social capital tend to be characterized by unspecified obliga- 
tions, uncertain time horizons, and the possible violation of reciprocity expec- 
tations. But, by their very lack of clarity, these transactions can help disguise 
what otherwise would be plain market exchanges (Bourdieu 1979, 1980). 

A second contemporary source is the work of economist Glen Loury (1977, 
1981). He came upon the term in the context of his critique of neoclassical 
theories of racial income inequality and their policy implications. Loury ar- 
gued that orthodox economic theories were too individualistic, focusing exclu- 
sively on individual human capital and on the creation of a level field for com- 
petition based on such skills. By themselves, legal prohibitions against em- 
ployers' racial tastes and implementation of equal opportunity programs 
would not reduce racial inequalities. The latter could go on forever, according 
to Loury, for two reasons—first, the inherited poverty of black parents, which 
would be transmitted to their children in the form of lower material resources 
and educational opportunities; second, the poorer connections of young black 
workers to the labor market and their lack of information about opportunities: 


The merit notion that, in a free society, each individual will rise to the level 
justified by his or her competence conflicts with the observation that no one 
travels that road entirely alone. The social context within which individual 
maturation occurs strongly conditions what otherwise equally ME bebe in- 
dividuals can achieve. This implies that absolute equality of opportunity,... 

an ideal that cannot be achieved. (Loury 1977, p. 176) 


Loury cited with approval the sociological literature on intergenerational 
mobility and inheritance of race as illustrating his anti-individualist argument. 
However, he did not go on to develop the concept of social capital in any detail. 
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He seems to have run across the idea in the context of his polemic against or- 
thodox labor economics, but he mentions it only once in his original article and 
then in rather tentative terms (Loury 19777). The concept captured the differen- 
tial access to opportunities through social connections for minority and nonmi- 
nority youth, but we do not find here any systematic treatment of its relations 
to other forms of capital. 

Loury's work paved the way, however, for Coleman's more refined analy- 
sis of the same process, namely the role of social capital in the creation of hu- 
man capital. In his initial analysis of the concept, Coleman acknowledges 
Loury's contribution as well as those of economist Ben-Porath and sociolo- 
gists Nan Lin and Mark Granovetter. Curiously, Coleman does not mention 
Bourdieu, although his analysis of the possible uses of social capital for the ac- 
quisition of educational credentials closely parallels that pioneered by the 
French sociologist.! Coleman defined social capital by its function as “a vari- 
ety of entities with two elements in common: They all consist of some aspect 
of social structures, and they facilitate certain action of actors—whether per- 
sons or corporate actors—within the structure" (Coleman 19882: p. S98, 1990, 
p. 302). 

This rather vague definition opened the way for relabeling a number of dif- 
ferent and even contradictory processes as social capital. Coleman himself 
started that proliferation by including under the term some of the mechanisms 
that generated social capital (such as reciprocity expectations and group en- 
forcement of norms); the consequences of its possession (such as privileged 
access to information); and the "appropriable" social organization that pro- 
vided the context for both sources and effects to materialize. Resources ob- 
tained through social capital have, from the point of view of the recipient, the 
character of a gift. Thus, it is important to distinguish the resources themselves 
from the ability to obtain them by virtue of membership in different social 
structures, a distinction explicit in Bourdieu but obscured in Coleman. Equat- 
ing social capital with the resources acquired through it can easily lead to tau- 
tological statements.” 

Equally important is the distinction between the motivations of recipients 
and of donors in exchanges mediated by social capital. Recipients’ desire to 


lThe closest equivalent to human capital in Bourdieu’s analysis 1s embodied cultural capital, 
which 1s defined as the habitus of cultural practices, knowledge, and demeanors learned through 
exposure to role models 1n the family and other environments (Bourdieu 1979). 
ying, for example, that student A has social capital because he obtained access to a large 
tuttion loan from his kin and that student B does not because she failed to do so neglects the 
possibility that B's kin network 1s equally or more motivated to come to her aid but simply lacks the 
means to do. Defining social capital as equivalent with the resources thus obtained is tantamount to 
saying that the successful succeed. This circularity 1s more evident in applications of social capital 
that define it as a property of collectivities. These are reviewed below. 
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gain access to valuable assets is readily understandable. More complex are the 
motivations of the donors, who are requested to make these assets available 
without any immediate return. Such motivations are plural and deserve analy- 
sis because they are the core processes that the concept of social capital seeks 
to capture. Thus, a systematic treatment of the concept must distinguish 
among: (a) the possessors of social capital (those making claims); (5) the 
sources of social capital (those agreeing to these demands); (c) the resources 
themselves. These three elements are often mixed in discussions of the concept 
following Coleman, thus setting the stage for confusion in the uses and scope 
of the term. 

Despite these limitations, Coleman's essays have the undeniable merit of 
introducing and giving visibility to the concept in American sociology, high- 
lighting its importance for the acquisition of human capital, and identifying 
some of the mechanisms through which it is generated. In this last respect, his 
discussion of closure is particularly enlightening. Closure means the existence 
of sufficient ties between a certain number of people to guarantee the obser- 
vance of norms. For example, the possibility of malfeasance within the tightly 
knit community of Jewish diamond traders in New York City is minimized by 
the dense ties among its members and the ready threat of ostracism against vio- 
lators. The existence of such a strong norm is then appropriable by all members 
of the community, facilitating transactions without recourse to cumbersome 
legal contracts (Coleman 1988a:S99). 

After Bourdieu, Loury, and Coleman, a number of theoretical analyses of 
social capital have been published. In 1990, WE Baker defined the concept as 
“a resource that actors derive from specific social structures and then use to 
pursue their interests; it is created by changes in the relationship among actors" 
(Baker 1990, p. 619). More broadly, M Schiff defines the term as "the set of 
elements of the social structure that affects relations among people and are in- 
puts or arguments of the production and/or utility function" (Schiff 1992, p. 
161). Burt sees it as "friends, colleagues, and more general contacts through 
whom you receive opportunities to use your financial and human capital" 
(Burt 1992, p. 9). Whereas Coleman and Loury had emphasized dense net- 
works as a necessary condition for the emergence of social capital, Burt high- 
lights the opposite situation. In his view, it is the relative absence of ties, la- 
beled "structural holes,” that facilitates individual mobility. This 1s so because 
dense networks tend to convey redundant information, while weaker ties can 
be sources of new knowledge and resources. 

Despite these differences, the consensus is growing in the literature that so- 
cial capital stands for the ability of actors to secure benefits by virtue of mem- 
bership in social networks or other social structures. This is the sense in which 
it has been more commonly applied in the empirical literature although, as we 
will see, the potential uses to which it is put vary greatly. 
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Sources of Social Capital 


Both Bourdieu and Coleman emphasize the intangible character of social capital 
relative to other forms. Whereas economic capital is in people's bank accounts 
and human capital is inside their heads, social capital inheres in the structure of 
their relationships. To possess social capital, a person must be related to others, 
andit is those others, not himself, who are the actual source of his or her advan- 
tage. As mentioned before, the motivation of others to make resources avail- 
able on concessionary terms 1s not uniform. At the broadest level, one may dis- 
tinguish between consummatory versus instrumental motivations to do so. 

As examples of the first, people may pay their debts in time, give alms to 
charity, and obey traffic rules because they feel an obligation to behave in this 
manner. The internalized norms that make such behaviors possible are then ap- 
propriable by others as a resource. In this instance, the holders of social capital 
are other members of the community who can extend loans without fear of 
nonpayment, benefit from private charity, or send their kids to play in the street 
without concern. Coleman (19882: S104) refers to this source in his analysis of 
norms and sanctions: “Effective norms that inhibit crime make it possible to 
walk freely outside at night in a city and enable old persons to leave their 
houses without fear for their safety " As is well known, an excessive emphasis 
on this process of norm internalization led to the oversocialized conception of 
human action in sociology so trenchantly criticized by Wrong (1961). 

An approach closer to the undersocialized view of human nature in modern 
economics sees social capital as primarily the accumulation of obligations 
from others according to the norm of reciprocity. In this version, donors pro- 
vide privileged access to resources in the expectation that they will be fully re- 
paid in the future. This accumulation of social chits differs from purely eco- 
nomic exchange in two aspects. First, the currency with which obligations are 
repaid may be different from that with which they were incurred in the first 
place and may be as intangible as the granting of approval or allegiance. Sec- 
ond, the timing of the repayment is unspecified. Indeed, if a schedule of repay- 
ments exists, the transaction is more appropriately defined as market exchange 
than as one mediated by social capital. This instrumental treatment of the term 
is quite familiar in sociology, dating back to the classical analysis of social ex- 
change by Simmel ([1902a] 1964), the more recent ones by Homans (1961) 
and Blau (1964), and extensive work on the sources and dynamics of reciproc- 
ity by authors of the rational action school (Schiff 1992, Coleman 1994). 

Two other sources of social capital exist that fit the consummatory versus 
instrumental dichotomy, but in a different way. The first finds its theoretical 
underpinnings in Marx's analysis of emergent class consciousness in the in- 
dustrial proletariat. By being thrown together in a common situation, workers 
learn to identify with each other and support each other's initiatives. This soli- 
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darity is not the result of norm introjection during childhood, but is an emer- 
gent product of a common fate (Marx [1894] 1967, Marx & Engels [1848] 
1947). For this reason, the altruistic dispositions of actors in these situations 
are not universal but are bounded by the limits of their community. Other 
members of the same community can then appropriate such dispositions and 
the actions that follow as their source of social capital. 

Bounded solidarity is the term used in the recent literature to refer to this 
mechanism. It is the source of social capital that leads wealthy members of a 
church to anonymously endow church schools and hospitals; members of a 
suppressed nationality to voluntarily join life-threatening military activities in 
its defense; and industrial proletarians to take part in protest marches or sym- 
pathy strikes in support of their fellows. Identification with one's own group, 
sect, or community can be a powerful motivational force. Coleman refers to 
extreme forms of this mechanism as “zeal” and defines them as an effective an- 
tidote to free-riding by others in collective movements (Coleman 1990, pp. 
273—82; Portes & Sensenbrenner 1993). 

The final source of social capital finds its classical roots in Durkheim's 
([1893] 1984) theory of social integration and the sanctioning capacity of 
group rituals. As in the case of reciprocity exchanges, the motivation of donors 
of socially mediated gifts is instrumental, but in this case, the expectation of re- 
payment is not based on knowledge of the recipient, but on the insertion of 
both actors in a common social structure. The embedding of a transaction into 
such structure has two consequences. First, the donor's returns may come not 


Sources: Definition Consequences 
- Norm Obervance (Social Control) 
- Value Introjection 
Consummatory - Farly Support 
~ Bounded Solidarity “^^ - Network-msdlated Benefits 
Abilityto 7 
Secure Benefits 
Membership in Networks and other 
Social Structures 
Instrumental - Reciprocity Exchanges 7 ` 
- Restricted Access to Opportunities 
- Enforceable Trust 
- Restrictions on Individual Freedom 
~ Excesstve Claims on Group Members 
- Downward Leveling Norms 


Figure 1 Actual and potential gains and losses 1n transactions mediated by social capital 
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directly from the recipient but from the collectivity as a whole in the form of 
status, honor, or approval. Second, the collectivity itself acts as guarantor that 
whatever debts are incurred will be repaid. 

As an example of the first consequence, a member of an ethnic group may 
endow a scholarship for young co-ethnic students, thereby expecting not re- 
payment from recipients but rather approval and status in the collectivity. The 
students' social capital is not contingent on direct knowledge of their benefac- 
tor, but on membership in the same group. Às an example of the second effect, 
a banker may extend a loan without collateral to a member of the same relig- 
ious community in full expectation of repayment because of the threat of com- 
munity sanctions and ostracism. In other words, trust exists in this situation 
precisely because obligations are enforceable, not through recourse to law or 
violence but through the power of the community. 

In practice, these two effects of enforceable trust are commonly mixed, as 
when someone extends a favor to a fellow member in expectation of both 
guaranteed repayment and group approval. As a source of social capital, en- 
forceable trust is hence appropriable by both donors and recipients: For recipi- 
ents, it obviously facilitates access to resources; for donors, it yields approval 
and expedites transactions because it ensures against malfeasance. No lawyer 
need apply for business transactions underwritten by this source of social capi- 
tal. The left side of Figure 1 summarizes the discussion in this section. Keeping 
these distinctions in mind is important to avoid confusing consummatory and 
instrumental motivations or mixing simple dyadic exchanges with those em- 
bedded in larger social structures that guarantee their predictability and course. 


Effects of Social Capital: Recent Research 


Just as the sources of social capital are plural so are its consequences. The em- 
pirical literature includes applications of the concept as a predictor of, among 
others, school attrition and academic performance, children's intellectual de- 
velopment, sources of employment and occupational attainment, juvenile de- 
linquency and its prevention, and immigrant and ethnic enterprise.? Diversity 
of effects goes beyond the broad set of specific dependent variables to which 
social capital has been applied to encompass, in addition, the character and 
meaning of the expected consequences. A review of the literature makes it pos- 
sible to distinguish three basic functions of social capital, applicable in a vari- 
ety of contexts: (a) as a source of social control; (5) as a source of family sup- 
port; (c) as a source of benefits through extrafamilial networks. 


The following review does not aim at an exhaustive coverage of the empirical literature. That 
task has been rendered obsolete by the advent of computerized topical searches My purpose 
instead 18 to document the principal types of application of the concept 1n the literature and to 
highlight their interrelatronshrps. 
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As examples of the first function, we find a series of studies that focus on 
rule enforcement. The social capital created by tight community networks is 
useful to parents, teachers, and police authorities as they seek to maintain dis- 
cipline and promote compliance among those under their charge. Sources of 
this type of social capital are commonly found in bounded solidarity and en- 
forceable trust, and its main result is to render formal or overt controls unnec- 
essary. The process is exemplified by Zhou & Bankston's studv of the tightly 
knit Vietnamese community of New Orleans: 


Both parents and children are constantly observed as under a “Vietnamese 
microscope." If a child flunks out or drops out of a school, or if a boy falls 
into a gang or a girl becomes pregnant without getting married, he or she 
brings shame not only to himself or herself but also to the family. (Zhou & 
Bankston 1996, p. 207) 


The same function is apparent in Hagan et al's (1995) analysis of right-wing 
extremism among East German youth. Labeling right-wing extremism a sub- 
terranean tradition in German society, these authors seek to explain the rise of 
that ideology, commonly accompanied by anomic wealth aspirations among 
German adolescents. These tendencies are particularly strong among those 
from the formerly communist eastern states. That trend is explained as the joint 
outcome of the removal of social controls (low social capital), coupled with the 
long deprivations endured by East Germans. Incorporation into the West has 
brought about new uncertainties and the loosening of social integration, thus 
allowing German subterranean cultural traditions to re-emerge. 

Social control is also the focus of several earlier essays by Coleman, who 
laments the disappearance of those informal family and community structures 
that produced this type of social capital; Coleman calls for the creation of for- 
mal institutions to take their place. This was the thrust of Coleman's 1992 
presidential address to the American Sociological Association, in which he 
traced the decline of "primordial" institutions based on the family and their re- 
placement by purposively constructed organizations. In his view, modern soci- 
ology's task is to guide this process of social engineering that will substitute 
obsolete forms of control based on primordial ties with rationally devised ma- 
terial and status incentives (Coleman 1988b, 1993). The function of social 
capital for social control is also evident whenever the concept is discussed in 
conjunction with the law (Smart 1993, Weede 1992). It is as well the central 
focus when it is defined as a property of collectivities such as cities or nations. 
This latter approach, associated mainly with the writings of political scientists, 
is discussed in a following section. 

The influence of Coleman's writings is also clear in the second function of 
social capital, namely as a source of parental and kin support. Intact families 
and those where one parent has the primary task of rearing children possess 
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more of this form of social capital than do single-parent families or those 
where both parents work. The primary beneficiaries of this resource are, of 
course, the children whose education and personality development are en- 
riched accordingly. Coleman (1988a:S110) thus cites approvingly the practice 
of Asian immigrant mothers who not only stay at home but often purchase sec- 
ond copies of school textbooks to help their offspring with their homework. 

A second example of this function is in McLanahan & Sandefur's mono- 
graph Growing Up with a Single Parent (1994), which examines the conse- 
quences of single parenthood for school achievement and attrition, teenage 
pregnancy, and other adolescent outcomes. Social capital tends to be lower for 
children in single-parent families because they lack the benefit of a second at- 
home parent and because they tend to change residences more often, leading to 
fewer ties to other adults in the community. This deficit is not the only causal 
factor but certainly plays an important role in bringing about less desirable 
educational and personality outcomes among single-parent children. Along 
the same lines, Parcel & Menaghan (1994a,b) have conducted extensive quan- 
titative analyses of national surveys to examine the effect of parental work on 
children's cognitive and social development. They conclude that parental in- 
tellectual and other resources contribute to the forms of family capital useful in 
facilitating positive children outcomes, but that common beliefs about a nega- 
tive effect of maternal work during early infancy are overgeneralized. 

A third example is Hao's (1994) analysis of kin support and out-of-wedlock 
motherhood. Like financial capital, social capital influences transfers made by 
parents to daughters and behavioral outcomes such as teen pregnancy, educa- 
tional attainment, and labor force participation. Social capital 1s greater in two- 
parent families, those with fewer children, and those where parents have high 
aspirations for their young. These conditions foster greater parental attention, 
more hours spent with children, and the emergence of an achievement orienta- 
tion among adolescents. 

Two interesting final examples highlight the role of family support as a 
counterweight to the loss of community bounds. In their longitudinal study of 
adolescents in Toronto, Hagan et al (1996) confirm Coleman's finding about 
the deleterious effect of multiple family moves on children's emotional adjust- 
ment and educational achievement. Leaving a community tends to destroy es- 
tablished bonds, thus depriving family and children of a major source of social 
capital. These authors find, however, an interaction effect leading to an exacer- 
bation of the loss among children whose parents provide them with weak sup- 
port and to a partial neutralization among those in the opposite situation. Pa- 
rental support leads to higher educational achievement, both directly and indi- 
rectly through compensating for the loss of community among migrants. 

Along the same lines, Gold (1995) highlights the change in parental roles 
among Israeli immigrant families in the United States. In Israel, close commu- 
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nity bonds facilitate supervision and rearing of children because other adults 
know the young and assume responsibility for their well-being. In the more 
anomic American environment, mothers are assigned the role of compensating 
for the lack of community ties with exclusive dedication to their children. 
Thus, female labor force participation is much greater in Israel than among Is- 
raelis in the United States as mothers endeavor to preserve an appropriate cul- 
tural environment for their young. Note that in both of these examples, reduc- 
tion of social capital in its first form—community social bonds and control—is 
partially compensated by an increase of social capital in its second form, famil- 
ial support. 

By far, however, the most common function attributed to social capital is as 
a source of network-mediated benefits beyond the immediate family. This 
definition comes closest to that of Bourdieu (1979, 1980), for whom parental 
support of children's development is a source of cultural capital, while social 
capital refers to assets gained through membership in networks. This third 
function is illustrated by Anheier et al’s (1995) use of blockmodeling tech- 
niques to map social ties among artists and intellectuals 1n the German city of 
Cologne. Results of their analysis show very strong networks among core 
members of the city's intellectual elite along with more restricted access to 
them for those in peripheral and commercial pursuits. From a methodological 
standpoint, this article is one of the most sophisticated applications of Bour- 
dieu's ideas to the sociology of culture. 

Yet, the most common use of this third form of social capital is in the field 
of stratification. It is frequently invoked here as an explanation of access to 
employment, mobility through occupational ladders, and entrepreneurial suc- 
cess. The 1dea that connections are instrumental in furthering individual mo- 
bility is central to Loury's analysis, as seen previously, and is also found 
among a number of authors who do not conceptualize it explicitly as social 
capital. Granovetter (1974), for example, coined the term "strength of weak 
ties" to refer to the power of indirect influences outside the immediate circle of 
family and close friends to serve as an informal employment referral system. 
The idea was original because it ran contrary to the commonsense notion that 
dense networks such as those available through family circles would be most 
effective in finding Jobs. Almost two decades later, Burt (1992) built on Gra- 
novetter's insight by developing the concept of “structural holes." As we have 
seen, Burt did employ the term social capital and, like Bourdieu's, his defini- 
tion is instrumental. In Burt’s case, however, social capital is based on the rela- 
tive paucity of network ties rather than on their density. 

Another noteworthy early effort was by Nan Lin, Walter Ensel, and John C 
Vaughn (1981), Social Resources and Strength of Ties, which points precisely 
in the opposite direction. Although Lin and his colleagues did not use the term 
social capital, Coleman (1988a) cites their work approvingly because of a 
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common emphasis on dense networks as a resource. This alternative stance 
which, in contrast to Granovetter and Burt, may be labeled “the strength of 
strong ties" is also evident in other areas of the social-networks-and-mobility 
literature, One of the most noteworthy is the study of immigrant and ethnic en- 
trepreneurship, in which networks and the social capital that flows through 
them are consistently identified as a key resource for the creation of small busi- 
nesses. Light, for example, has emphasized the importance of rotating credit 
associations (RCAs) for the capitalization of Asian immigrant firms in the 
United States. RCAs are informal groups that meet periodically, with every 
member contributing a set amount to a common pool that is received by each 
in turn. Social capital in this case comes from the trust that each participant has 
in the continuing contribution of others even after they receive the pooled 
funds. Without such trust, no one will contribute and each will be deprived of 
this effective means to gain access to finance (Light 1984, Light & Bonacich 
1988). 

The role of social networks is equally important in studies of ethnic busi- 
ness enclaves and ethnic niches. Enclaves are dense concentrations of immi- 
grant or ethnic firms that employ a significant proportion of their co-ethnic la- 
bor force and develop a distinctive physical presence in urban space. Studies of 
New York's Chinatown (Zhou 1992); of Miami's Little Havana (Portes 1987, 
Portes & Stepick 1993, Perez 1992); and of Los Angeles' Koreatown (Light & 
Bonacich 1988, Nee et al 1994) consistently highlight the role of community 
networks as a source of vital resources for these ethnic firms. Such resources 
include but are not limited to start-up capital; others are tips about business op- 
portunities, access to markets, and a pliant and disciplined labor force. 

Ethnic niches emerge when a group is able to colonize a particular sector of 
employment in such a way that members have privileged access to new job 
openings, while restricting that of outsiders. Examples documented in the lit- 
erature range from restaurant work and garment factories all the way to police 
and fire departments and certain branches of the New York and Miami civil 
services (Waters 1994, Doeringer & Moss 1986, Bailey & Waldinger 1991, 
Waldinger 1996, Stepick 1989). As in the case of enclaves, mobility opportu- 
nities through niches are entirely network-driven. Members find Jobs for oth- 
ers, teach them the necessary skills, and supervise their performance. The 
power of network chains is such that entry level openings are frequently filled 
by contacting kin and friends in remote foreign locations rather than by tap- 
ping other available local workers (Sassen 1995). 

The opposite of this situation is the dearth of social connections in certain 
impoverished communities or their truncated character. Since publication of 
Carol Stack’s Ail Our Kin (1974), sociologists know that everyday survival in 
poor urban communities frequently depends on close interaction with kin and 
friends in similar situations. The problem is that such ties seldom reach beyond 
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the inner city, thus depriving their inhabitants of sources of information about 
employment opportunities elsewhere and ways to attain them. Wacquant & 
Wilson (1989) and Wilson (1987, 1996) also emphasize how the departure of 
both industrial employment and middle-class families from black inner city 
areas have left the remaining population bereft of social capital, a situation 
leading to its extremely high levels of unemployment and welfare depend- 
ency. 

The same point is central to Mercer Sullivan's (1989) comparative ethno- 
graphies of Puerto Rican, black, and working-class white youth in three New 
York communities. Sullivan challenges blanket assertions about youth subcul- 
tures as determinants of deviant behavior by showing that access to regular 
Jobs and participation in deviant activities are both network mediated. As Gra- 
novetter (1974) had noted earlier, teenagers seldom find jobs; instead jobs 
come to them through the mediation of parents and other adults in their imme- 
diate community. Sullivan shows how such networks are much feebler in the 
case of black youth because of the scarcity of occupants of influential positions 
in the adult generation. Thrown back on their own resources, black adolescents 
are seldom able to compete successfully for good regular jobs; thus they be- 
come available for alternative forms of income earning. 

In her analysis of teenage pregnancy in Baltimore's ghetto, Fernandez- 
Kelly (1995) notes how the dense but truncated networks of inner-city black 
families not only cut off members from information about the outside world, 
but simultaneously support alternative cultural styles that make access to 
mainstream employment even more difficult. In this isolated context, teenage 
pregnancy is not the outgrowth of carelessness or excess sexuality but, more 
commonly, a deliberate means to gain adult status and a measure of independ- 
ence. 

Similarly, Stanton-Salazar & Dornbush (1995) have investigated the rela- 
tionship between outside social networks and academic achievement and aspi- 
rations among Mexican high school students in the San Francisco area. They 
find positive correlations among these variables, although the strongest asso- 
ciations are with bilingualism, suggesting the role of cultural capital in status 
attainment, In a related article, Valenzuela & Dornbush (1994) highlight the 
role of family networks and a familistic orientation in the academic achieve- 
ment of Mexican-origin students. Paralleling the studies of Hagan et al (1996) 
and Gold (1995), these articles suggest that immigrant families compensate for 
the absence of the third form of social capital—outside networks—with an em- 
phasis on social capital in the form of familial support, including preservation 
of the cultural orientations of their home country. 

As in the case of the various sources of social capital outlined in the last sec- 
tion, it is also important to keep in mind the differing functions of the concept 
both to avoid confusion and to facilitate study of their interrelationships. It is 
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possible, for example, that social capital in the form of social control may clash 
with social capital in the form of network-mediated benefits, if the latter con- 
sists precisely on the ability to bypass existing norms. The capacity of authori- 
ties to enforce rules (social control) can thus be jeopardized by the existence of 
tight networks whose function is precisely to facilitate violation of those rules 
for private benefit. These paradoxical outcomes point to the need of a closer 
look at the actual and potential gainers and losers in transactions mediated by 
social capital. The right side of Figure 1 summarizes the previous discussion 
and that of the next section. 


Negative Social Capital d 


The research literature on social capital strongly emphasizes its positive con- 
sequences. Indeed it is our sociological bias to see good things emerging out of 
sociability; bad things are more commonly associated with the behavior of 
homo economicus. However, the same mechanisms appropriable by individu- 
als and groups as social capital can have other, less desirable consequences. It 
is important to emphasize them for two reasons: first, to avoid the trap of pre- 
senting community networks, social control, and collective sanctions as un- 
mixed blessings; second, to keep the analysis within the bounds of serious so- 
ciological analysis rather than moralizing statements. Recent studies have 
identified at least four negative consequences of social capital: exclusion of 
outsiders, excess claims on group members, restrictions on individual free- 
doms, and downward leveling norms. I summarize them next. 

First, the same strong ties that bring benefits to members of a group com- 
monly enable it to bar others from access. Waldinger (1995) describes the tight 
control exercised by white ethnics—descendants of Italian, Irish, and Polish 
immigrants—over the construction trades and the fire and police unions of 
New York. Other cases include the growing control of the produce business by 
Korean immigrants in several East Coast cities, the traditional monopoly of 
Jewish merchants over the New York diamond trade, and the dominance of 
Cubans over numerous sectors of the Miami economy. In each instance, social 
capital generated by bounded solidarity and trust are at the core of the group's 
economic advance. But, as Waldinger (1995, p. 557) points out, “the same so- 
cial relations that. .enhance the ease and efficiency of economic exchanges 
among community members implicitly restrict outsiders." 

Ethnic groups are not the only ones that use social capital for economic ad- 
vantage. Two centuries ago, Adam Smith ([1776] 1979, p. 232) complained 
that meetings of merchants inevitably ended up as a conspiracy against the 
public. The public, of course, are all those excluded from the networks and mu- 


4This section 1s partially based on Portes & Sensenbrenner (1993) and Portes & Landolt (1996) 
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tual knowledge linking the colluding groups. Substitute for “merchants” white 
building contractors, ethnic union bosses, or immigrant entrepreneurs, and the 
contemporary relevance of Smith's point becomes evident. 

The second negative effect of social capital is the obverse of the first be- 
cause group or community closure may, under certain circumstances, prevent 
the success of business initiatives by their members. In his study of the rise of 
commercial enterprises in Bali, Geertz observed how successful entrepreneurs 
were constantly assaulted by job and loan-seeking kinsmen. These claims 
were buttressed by strong norms enjoining mutual assistance within the ex- 
tended family and among community members in general (Geertz 1963). The 
result was to turn promising enterprises into welfare hotels, checking their eco- 
nomic expansion. 

Granovetter (1995), who calls attention to this example, notes that it is an 
instance of the problem that classic economic development theory identified 
among traditional enterprises. Weber ([1922] 1965) made the same point when 
he stressed the importance of impersonal economic transactions guided by the 
principle of universalism as one of the major reasons for Puritan entrepreneu- 
rial success. Thus, cozy intergroup relations of the kind found in highly soli- 
dary communities can give rise to a gigantic free-riding problem, as less dili- 
gent members enforce on the more successful all kinds of demands backed by a 
shared normative structure. For claimants, their social capital consists pre- 
cisely of privileged access to the resources of fellow members. In the process, 
opportunities for entrepreneurial accumulation and success are dissipated.? 

Third, community or group participation necessarily creates demands for 
conformity. In a small town or village, all neighbors know each other, one can 
get supplies on credit at the corner store, and children play freely in the streets 
under the watchful eyes of other adults. The level of social control in such set- 
tings is strong and also quite restrictive of personal freedoms, which is the rea- 
son why the young and the more independent-minded have always left. Bois- 
sevain (1974) reports such a situation in his study of village life in the island of 
Malta. Dense, “multiplex”® networks tying inhabitants together created the 


5A related problem has been observed 1n inner city neighborhoods where kin networks form a 
key survival resource through mutual assistance and ready access to favors and smallloans By the 
same token, the norm that dictates that incoming resources (such as a money prize) be shared with 
relatives and friends effectively prevents any sustained accumulation or entrepreneurial investment 
by individuals. Those wishing to pursue that route must distance themselves from their former 
partners (see Uehara 1990, Fernandez-Kelly 1995, Stack 1974). 

6Multiplexity refers to overlapping social networks where the same people are linked together 
across different roles. In small towns, for example, the same individuals may be simultaneously 
kin, neighbors, and co-workers thus intensifying the intensity and capacity for mutual monitoring 
of their ties (Boissevain 1974, p. 31—33) 
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ground for an intense community life and strong enforcement of local norms. 
The privacy and autonomy of individuals were reduced accordingly. 

This is an expression of the age-old dilemma between community solidarity 
and individual freedom analyzed by Simmel ([1902] 1964) in his classic essay 
on *The Metropolis and Mental Life." In that essay, Simmel came out in favor 
of personal autonomy and responsibility. At present, the pendulum has swung 
back, and a number of authors are calling for stronger community networks 
and norm observance in order to re-establish social control. This may be desir- 
able in many instances, but the downside of this function of social capital must 
also be kept in mind. 

Constraints on individual freedom may be responsible for Rumbaut's find- 
ings that high levels of familistic solidarity among recent immigrant students 
are negatively related to four different educational outcomes, including grades 
and standardized test scores. According to this author, “family ties bind, but 
sometimes these bonds constrain rather than facilitate particular outcomes" 
(Rumbaut 1977, p. 39). 

Fourth, there are situations in which group solidarity is cemented by a com- 
mon experience of adversity and opposition to mainstream society. In these in- 
stances, individual success stories undermine group cohesion because the lat- 
ter is precisely grounded on the alleged impossibility of such occurrences. The 
result is downward leveling norms that operate to keep members of a down- 
trodden group in place and force the more ambitious to escape from it. In his 
ethnographic research among Puerto Rican crack dealers in the Bronx, Bour- 
gois (1991, 1995) calls attention to the local version of this process, which sin- 
gles out for attack individuals seeking to join the middle-class mainstream. He 
reports the views of one of his informants: 


When you see someone go downtown and get a good job, 1f they be Puerto 
Rican, you see them fix up their hair and put some contact lenses ın their 
eyes. Then they fit in and they do it! I have seen it!... Look at all the people in 
that building, they all "turn-overs." They people who want to be white. Man, 
if you call them in Spanish it wind up a problem. I mean like take the name 
Pedro—T'm just telling you this as an example—Pedro be saying (imitating a 
whitened accent) "My name is Peter." Where do you get Peter from Pedro? 
(Bourgois 1991, p. 32) 


similar examples are reported by Stepick (1992) in his study of Haitian- 
American youth in Miami and by Suarez-Orozco (1987) and Matute-Bianchi 
(1986, 1991) among Mexican-American teenagers in Southern California. In 
each instance, the emergence of downward leveling norms has been preceded 
by lengthy periods, often lasting generations, in which the mobility of a par- 
ticular group has been blocked by outside discrimination. That historical expe- 
rience underlines the emergence of an oppositional stance toward the main- 
stream and a solidarity grounded in a common experience of subordination. 
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Once in place, however, this normative outlook has the effect of helping per- 
petuate the very situation that it decries. 

Notice that social capital, in the form of social control, is still present in 
these situations, but its effects are exactly the opposite of those commonly 
celebrated in the literature. Whereas bounded solidarity and trust provide the 
sources for socioeconomic ascent and entrepreneurial development among 
some groups, among others they have exactly the opposite effect. Sociability 
cuts both ways. While it can be the source of public goods, such as those cele- 
brated by Coleman, Loury, and others, it can also lead to public “bads.” Mafia 
families, prostitution and gambling rings, and youth gangs offer so many ex- 
amples of how embeddedness in social structures can be turned to less than so- 
cially desirable ends. The point is particularly important as we turn to the more 
recent and more celebratory versions of social capital. 


Social Capital as a Feature of Communities and Nations’ 


As seen in previous sections, sociological analyses of social capital have been 
grounded on relationships between actors or between an individual actor and a 
group. Throughout, the focus has been on the potential benefit accruing to ac- 
tors because of their insertion into networks or broader social structures. An in- 
teresting conceptual twist was introduced by political scientists who equate so- 
cial capital with the level of “civicness” in communities such as towns, cities, 
or even entire countries. For Robert Putnam, the most prominent advocate of 
this approach, social capital means "features of social organizations, such as 
networks, norms, and trust, that facilitate action and cooperation for mutual 
benefit." The collective character of this version of the concept is evident in 
the next sentence: "Working together is easier in a community blessed with a 
substantial stock of social capital" (Putnam 1993, pp. 35—36). 

In practice, this stock is equated with the level of associational involvement 
and participatory behavior ın a community and is measured by such indicators 
as newspaper reading, membership in voluntary associations, and expressions 
of trust in political authorities. Putnam is not shy about the expected reach and 
significance of this version of social capital: 

This insight turns out to have powerful practical implications for many 1ssues 
on the American national agenda—for how we might overcome the poverty 


and violence of South Central Los Angeles ..or nurture the fledgling democ- 
racies of the former Soviet empire. (Putnam 1993. 36, 1996) 


The prospect of a simple diagnosis of the country’s problems and a ready 
solution to them has attracted widespread public attention. Putnam’s article, 
“Bowling Alone: America’s Declining Social Capital,” published in the Jour- 


"Thus section i partially based on Portes & Landolt (1996) 
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nal of Democracy in 1995, created something of a sensation, earning for its 
author a téte-à-téte with President Clinton and a profile in People magazine. The 
nostalgic image evoked by the lonely bowler resonated with many powerful 
members of the American establishment and even inspired passages 1n Clinton's 
State of the Union address in 1995 (Pollitt 1996, Lemann 1996). Putnam but- 
tressed his case with figures about rapidly declining levels of voting and mem- 
bership in such organizations as the PTA, the Elks Club, the League of Women 
Voters, and the Red Cross. He then identified the immediate determinant of the 
decreasing national stock of social capital, namely the passage from the scene of 
the civic generation active during the 1920s and 1930s and the succession of an 
uncivic generation—the baby boomers— born and raised after World War II: 


...the very decades that have seen a national deterioration 1n social capital 
are the same decades during which the numerical dominance of a trusting 
and civic generation has been replaced by this domination of post-civic co- 
horts... Thus a generational analysis leads almost inevitably to the conclu- 
sion that the national slump in trust and engagement is likely to continue. 
(Putnam 1996, pp. 45—46) 


Critics have focused on the question of whether voluntarism and civic spirit 
have actually declined in America and on the unacknowledged class bias in 
Putnam's thesis. Lay reviewers such as Lemann in The Atlantic Monthly and 
Pollitt in The Nation questioned whether American civic virtue is on the wane 
or has simply taken new forms different from the old-style organizations cited 
in Putnam's article. They also note the elitist stance of the argument, where re- 
sponsibility for the alleged decline of social capital is put squarely on the lei- 
sure behavior of the masses, rather than on the economic and political changes 
wrought by the corporate and governmental establishment. In her trenchant re- 
view of Putnam's thesis, Skocpol (1996, p. 25) also stresses this point: 


How ironic it would be if, after pulling out of locally rooted associations, the 
very business and professional elites who blazed the path toward local civic 
disengagement were now to turn around and successfully argue that the less 
privileged Americans they left behind are the ones who must repair the na- 
tion's social connectedness... 


These critiques are valid but do not address a more fundamental problem 
with Putnam’s argument, namely its logical circularity. As a property of com- 
munities and nations rather than individuals, social capital is simultaneously a 
cause and an effect. It leads to positive outcomes, such as economic develop- 
ment and less crime, and its existence is inferred from the same outcomes. Cit- 
ies that are well governed and moving ahead economically do so because they 
have high social capital; poorer cities lack in this civic virtue. This circularity 
is well illustrated in passages like the following: 
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Some regions of Italy...have many active community organizations....These 
“civic communities" value solidarity, civic participation, and integrity. And 
here democracy works. At the other end are “uncivic” regions, like Calabria 
and Sicily, aptly characterized by the French term mcrvisme. The very con- 
cept of citizenship 1s stunted here. (Putnam 1993, p. 36) 


In other words, if your town is “civic,” it does civic things; if it is “uncivic,” it 
does not. 

Tautology in this definition of social capital results from two analytic deci- 
sions; first, starting with the effect (1.e. successful versus unsuccessful cities) 
and working retroactively to find out what distinguishes them: second, trying 
to explain all of the observed differences. In principle, the exercise of seeking 
to identify post-factum causes of events is legitimate, provided that alternative 
explanations are considered. In fairness to Putnam, he does this 1n his analysis 
of differences between the well-governed towns of the Italian north and the 
poorly governed ones of the south (Putnam 1993, Lemann 1996). Such retro- 
active explanations can only be tentative, however, because the analyst can 
never rule out other potential causes and because these explanations remain 
untested in cases other than those considered. 

- More insidious, however, is the search for full explanation of all observed 
differences because the quest for this prime determinant often ends up by rela- 
beling the original problem to be explained. This happens as the elimination of 
exceptions reduces the logical space between alleged cause and effect so that the 
final predictive statement is either a truism or circular.’ In Putnam's analysis 
of Italian cities, such factors as differences in levels of economic development, 
education, or political preferences proved to be imperfect predictors. Thus, the 
search for a prime determinant gradually narrowed to something labeled (fol- 
lowing Machiavelli) vertu civile (civic virtue). It is present in those cities 
whose inhabitants vote, obey the law, and cooperate with each other and whose 
leaders are honest and committed to the public good (Putnam 1993, 1995). 

The theory then goes on to assert that civic virtue is the key factor differenti- 
ating well-governed communities from poorly governed ones. It could hardly 
be otherwise given the definition of the causal variable. Thus, cities where ev- 
eryone cooperates in maintaining good government are well governed. To 
avoid saying the same thing twice, the analyst of social capital must observe 
certain logical cautions: first, separating the definition of the concept, theoreti- 


The method of analytic induction, popular in American sociology in the 1940s and 1950s, 
consisted precisely in this process of seeking to explain all cases and gradually eliminate all 
exceptions. It went rapidly out of favor when it was discovered that 1t basically gave rise to 
tautologies by redefining the essential characteristics of the phenomenon to be explained. The only 
way of guaranteeing closure or zero exceptions turns out to be an explanation that 1s a logical 
corollary of the effect to be explained. On analytic induction, see Turner (1953) and Robinson 
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cally and empirically, from its alleged effects; second, establishing some con- 
trols for directionality so that the presence of social capital is demonstrably 
prior to the outcomes that it is expected to produce; third, controlling for the 
presence of other factors than can account for both social capital and its alleged 
effects; fourth, identifying the historical origins of community social capital in 
a systematic manner. 

This task is doable, but time-consuming. Instead, the intellectual journey 
that transformed social capital from an individual property into a feature of cit- 
ies and countries tended to disregard these logical criteria. The journey was 
fast, explaining major social outcomes by relabeling them with a novel term 
and then employing the same term to formulate sweeping policy prescriptions. 
While I believe that the greatest theoretical promise of social capital lies at the 
individual level—exemplified by the analyses of Bourdieu and Cole- 
man—there is nothing intrinsically wrong with redefining it as a structural 
property of large aggregates. This conceptual departure requires, however, 
more care and theoretical refinement than that displayed so far.? 


Conclusion 


Current enthusiasm for the concept reviewed in this article and its proliferating 
applications to different social problems and processes is not likely to abate 
soon. This popularity is partially warranted because the concept calls attention 
to real and important phenomena. However, it is also partially exaggerated for 
two reasons. First, the set of processes encompassed by the concept are not 
new and have been studied under other labels in the past. Calling them social 
capital is, to a large extent, just a means of presenting them in a more appealing 
conceptual garb. Second, there 1s little ground to believe that social capital will 
provide a ready remedy for major social problems, as promised by its bolder 
proponents. Recent proclamations to that effect merely restate the original 
problems and have not been accompanied so far by any persuasive account of 
how to bring about the desired stocks of public civicness. 

At the individual level, the processes alluded to by the concept cut both 
ways. Social ties can bring dbout greater control over wayward behavior and 
provide privileged access to resources; they can also restrict individual free- 
doms and bar outsiders from gaining access to the same resources through par- 
ticularistic preferences. For this reason, it seems preferable to approach these 


9A promising effort in this direction has been made by Woolcock (1997), who seeks to apply 
the concept of social capital to the analysis of national and community development in Third World 
countries. After an extensive review of the literature, he notes that "definitions of social capital 
should focus primarily on its sources rather than its consequences since long-term benefits, if and 
when they occur, are the result of a combination of different. . types of social relations, combinations 
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manifold processes as social facts to be studied in all their complexity, rather 
than as examples of a value. A more dispassionate stance will allow analysts to 
consider all facets of the event in question and prevent turning the ensuing lit- 
erature into an unmitigated celebration of community. Communitarian advo- 
cacy is a legitimate political stance; it is not good social science. As a label for 
the positive effects of sociability, social capital has, in my view, a place in the- 
ory and research provided that its different sources and effects are recognized 
and that their downsides are examined with equal attention. 
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ABSTRACT 


Since the rise of the religious right, scholars have become increasingly inter- 
ested in studying conservative Protestantism. Not only do conservative Prot- 
estants (CPs) make up at least a quarter of the US population; they differ 
from many Americans in gender-role attitudes, childrearing styles, political 
orientation, and other ways as well. In fact, religious factors often predict 
people's political views better than do either class or gender, even though the 
latter two have received far more attention in the scholarly literature (Manza 
& Brooks 1997, Kellstedt et al 1996b). Unfortunately research in this area 
has been hampered by imprecise measurement and poor understanding of the 
various movements grouped together as CPs. This has muddied statistical re- 
sults, stifled theoretical development, and blinded researchers to promising 
areas of analysis. Thus, in this chapter we first discuss the history and distinc- 
tive qualities ofthe various CP movements, then we use these insights to pro- 
pose better survey measures, and finally we apply this knowledge to several 
substantive areas (i.e., gender-role attitudes, childrearing styles, tolerance, 
the “culture wars," the religious nght, and the reasons for the religious vital- 
ity of CP groups). 


PARAMETERS OF THIS CHAPTER 


Defining conservative Protestantism is difficult because conservative Protes- 
tants (CPs) belong to such a jumble of different denominations and movements, 
and they do not agree on any one label or set of beliefs (Dayton & Johnston 
199], Marsden 19872, Kellstedt et al 1996a,c). To add to the confusion, many 
social scientists and journalists use the terms “fundamentalist,” “evangelical,” 
“born again," “conservative Protestant," and “religious right” indiscriminately 
without considering the differing meanings of these terms (Kellstedt & Smidt 
1996). Survey researchers also employ widely varying measurement strategies, 
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which can create seemingly contradictory results. Even the label “conserva- 
tive" Protestant is problematic. Although CPs are generally conservative on 
some theological issues, they are often innovative on others, breaking patterns 
of classical Protestant thought, creating new worship styles, etc. Their resis- 
tance to modernity is highly selective (Oldfield 1996, p. 49, Dayton 1991). 

Moreover, not all CPs are conservative politically. A sizable portion are 
Democrats and economically liberal (Hart 1992, Jelen 1987). If anything, the 
average white CP is more economically liberal than mainline Protestants. This 
is especially true for biblical literalists and members of holiness or Pentecostal 
denominations (though “Baptist-fundamentalists” tend to be more economi- 
cally conservative) (Iannaccone 1993, Pyle 1993).! On social issues CPs are 
generally more conservative (as measured by their statistical mean) but often 
also have greater diversity in their views than the general public (1.e., a signifi- 
cantly larger standard deviation) (Gay et al 1996, Gay & Ellison 1993, DiMag- 
gio et al 1996). 

To avoid the problem of confusing religious and political “conservatism,” 
many scholars use the term “evangelical” to describe all CPs. These scholars 
also argue that “evangelical” and “mainline” Protestantism are identifiably his- 
toric traditions, whereas other categorizations have more arbitrary dividing 
lines. However, using the general “evangelical” category has two problems. First, 
not all CPs accept the label, and some actively deny it (Dayton 1991), Although 
there may be a loose affinity between the groups categorized as “evangelical,” it 
is not clearly the only or best label. Second, the term evangelical is confusing, 
because scholars simultaneously use it to describe the moderate wing of contem- 
porary CPs, CPs as a whole, and nineteenth century revivalism (often in the 
same text). Thus, those not familiar with the movements have difficulty deter- 
mining when the term refers to moderate CPs and when to all of them. To avoid 
this problem we use “CP” as the general term, and “evangelical” for the mod- 
erate wing of CPs which emerged after World War II. In doing this, we are ad- 
justing the terminology of scholars like Kellstedt et al (1996a); we are not chal- 
lenging their definition of the broad “evangelical” category or their division of 
Protestant denominations into “mainline,” “evangelical,” and “black Protestant" 
categories. In the history section we also refer to nineteenth century Protestant 
revivalistic movements as “evangelical.” This matches the historical literature 
and is distinct enough in time to avoid undue confusion. Our terminology cre- 
ates problems as well, but we have not discovered any unproblematic terms.” 


l Conservative Protestants may be becoming more economically conservative (see Kellstedt et 
al 1996d). 

2Alternatıve solutions would be to call all CPs “traditional Protestants” or “theologically 
conservative Protestants" or to call all CPs "evangelicals" and moderate CPs "neo-evangelicals." We 
do not recommend dividing Protestants into “conservatives,” “moderates,” and “liberals,” Unfortunately, 
however, our terminology may create some confusion with this alternative categonzation 
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This chapter focuses on white CPs. Although black CPs are conservative 
theologically, they are very different politically and are separated institution- 
ally from white CPs. Thus, most researchers analyze them as a distinct group. 
For good summaries of research on black CPs, see Lincoln & Mamiya (1990), 
Payne (1995), and Sernett (1991). Scholars have generally ignored Asian- 
American and Latino/a-American CPs. Fortunately, an edited volume that in- 
cludes chapters on these is forthcoming (Warner & Wittner 1998). 

A substantial literature also compares “fundamentalisms” around the world 
[e.g., the five-volume series edited by Marty & Appleby (1991-1995)].3 A 
number of scholars have criticized these works for conflating conservative re- 
ligious movements with postcolonial nationalist religious movements and for 
brushing over the substantial differences between, for example, American 
Protestant fundamentalism and Lebanese Muslim fundamentalism [see Mun- 
son 1995a,b, Billings & Scott 1994, and review symposiums in the Review of 
Religious Research 35(1), (4), 37(4)]. Others contend that as long as proper 
qualifications are made, comparison facilitates new insights (Appleby 1995). 
This chapter focuses exclusively on CPs in the United States. 


HISTORICAL CONTEXT 


Fundamentalists and Evangelicals 


Modern American fundamentalism grew out of the nineteenth century evan- 
gelical movement. In the first half of the century, the revivalistic evangelical 
tradition grew rapidly, outstripping the Congregationalists, old-light Presbyte- 
rians, and Anglicans who had dominated the colonial religious map but did not 
adjust well to the challenges of the expanding frontier (Finke & Stark 1992). 
Northern evangelicals were active social reformers and provided the major im- 
petus behind abolitionism, temperance, and a number of similar social move- 
ments (Carwardine 1993, Marsden 1987a, Smith 1957); they were also early 
advocates of the separation of church and state (McLoughlin 1971). 

Following the Civil War, tensions developed between Northern evangelical 
leaders over Darwinism and higher biblical criticism; Southerners remained 
unified in opposition to both (Marsden 1980, 1991). Modernists attempted to 
update Christianity to match their view of science. They denied biblical mira- 
cles and argued that God manifests himself through the social evolution of so- 
ciety. Conservatives resisted these changes. These latent tensions erupted to 
the surface after World War I in what came to be called the fundamental- 
isu/modernist split. 


JOther literature examines the social and political mmplications of the spread of American CP 
movements overseas (Hallum 1996, Walls 1996, Brusco 1995, Cox 1995, Martin 1990). For good 
or ull, these movements are profoundly influencing Asia, Africa, and Latin America; yet social 
scientists have paid little attention to them (Cox 1995) 
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Shortly before the war, conservative scholars published a series of mono- 
graphs called The Fundamentals of the Christian Religion, which argued for 
the authority of Scripture, the veracity of biblical miracles, and salvation 
through Christ alone. They claimed that these doctrines are so fundamental to 
the Christian faith that those who deny them are outside the Christian tradition. 
As a result they were labeled “fundamentalists.” Many fundamentalists 
viewed the rise of Bolshevism as the natural outcome of modernism, and the 
barbarity of “civilized” Europe during World War I as a resounding disproof of 
modernist beliefs in the perfectibility of society and the goodness of human na- 
ture. Society was not becoming better, it was becoming worse; social reform 
and education could not overcome human sinfulness. 

Modernists had their greatest strength among Northern denominational 
leaders and seminary professors; fundamentalists, among pastors and laity. 
Thus, Nortbern fundamentalists struggled to wrest control of denominational 
hierarchies and seminaries from modernists. But when inclusive moderates 
sided with the modernists for the sake of tolerance, the fundamentalists were 
defeated. Fundamentalists also experienced a humiliating public-opinion de- 
feat in the Scopes trial of 1925 and increasingly withdrew from the public spot- 
light to build their own separate institutions.* Some formed separate denomi- 
nations; others stayed in existing denominations but developed networks of 
parachurch organizations outside denominational control (Marsden 1980, Car- 
penter 1997). 

However, significant tension still existed within fundamentalism about 
how much to "separate from the world." Many fundamentalist leaders were 
embarrassed by the fractious, anti-intellectual image of fundamentalism and 
sought to differentiate themselves from the more extreme elements of the 
movement. They wanted a more open, intellectually engaged version of classi- 
cal Protestantism, and therefore they called themselves *neo-evangelicals," in 
reference to the nineteenth century evangelical movement.? Initially most 
neoevangelical institutions were in the North and West, although the move- 
ment spread to the South as well. Eventually the “neo” dropped away, and 
these CPs became "evangelicals" (Marsden 1987a, 1987b).9 

Currently, “fundamentalism” properly refers to a small subset of CPs 
(although it is often misused to refer to all CPs). Fundamentalists emphasize a 
strict literal interpretation of the Bible, dispensational theology, premillennial 
eschatology, and institutional separation from “apostasy” (1.e., liberal Protes- 
tants and Catholics) (see Weber 1991, Marsden 1987a, 1991, Ammerman 


4See Numbers (1993) for a history of creationism from the mid-1800s to the mid-1980s 
*The seminal work on fundamentalism between the 1920s and the rise of neoevangelism is 
ter (1997). 
or a list of resources and annotated bibliographies about CPs, see Blumhofer & Carpenter 
(1990) and Magnuson & Travis (1990, 1997) 
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1987). Although fundamentalists have some loose national associations, such 
as Jerry Falwell’s Baptist Bible Fellowship, power remains predominantly 
with individual pastors.’ 


Pentecostals and Charismatics 


Conservative Protestantism also contains two other major movements: Pente- 
costalism and the charismatic renewal. Pentecostals emerged around the turn 
of the century from the “Holiness” wing of nineteenth century evangelicalism. 
They emphasized the gifts of the Holy Spirit (especially divine healing, proph- 
ecy, and speaking in tongues) and spread primarily among socially marginal- 
ized groups (1.e., poor whites, blacks, and immigrants) (Anderson 1987, Old- 
field 1996). In some early revivals (e.g., Azusa Street), blacks and whites wor- 
shipped together, but each soon formed separate institutions (Sernett 1991). 
Like fundamentalists, Pentecostals have traditionally been separatistic (An- 
derson 1987), but fundamentalists and Pentecostals have not gotten along 
well with each other either. One area of contention is the basis of religious 
authority. For fundamentalists, miraculous signs (especially prophecy and 
speaking in tongues) ended when the writing and compilation of the Bible was 
completed; religious authority is based on a “literal” interpretation of these sa- 
cred texts.? Pentecostals, on the other hand, believe that God still continues to 
reveal his will through prophets. To fundamentalists this seems to challenge 
the final authority of Scripture. Fundamentalists emphasized doctrine; Pente- 
costals, experience (also see Wilcox 1996, pp. 28-30). 

These antipathies have had important implications for recent political mo- 
bilizations. For example, even though Jerry Falwell (a fundamentalist) and Pat 
Robertson (a charismatic/Pentecostal) are both Southern, Republican, theo- 
logically conservative, and politically conservative, both have had difficulty 
gaining support among the others' constituencies (Oldfield 1996, Wilcox 
1996, Green 1996). Traditionally neoevangelicals have distanced themselves 
from both fundamentalists and Pentecostals because of disagreements about 
how much Christians must separate themselves from the world (although class 
and regional differences probably also play a role). Most evangelicals also 
tend to feel uncomfortable with the more exuberant aspects of Pentecostal 
worship and with claims of prophesy, healing, etc. Politically, Pentecostals are 


7Fundamentzalists are further divided over the strictness of separation, strict separatists are a 
very small group, less separatist groups are larger (see Marsden 1991) 

8Some Pentecostals have become more ecumenical and joined the National Association of 
Evangelicals (e g , the Assemblies of God). 

undamentalists tend to be “dispensationalists.” According to this theory, God interacts with 

his people in different ways during different phases of history (ie. different dispensations). 
Miracles were necessary in biblical times to establish the authority of the apostles and prophets who 
wrote the Bible, but in the current dispensation ("the age of the church”), they are not 
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the most conservative on abortion and other social issues, the most liberal on 
welfare spending, and statistically the least likely to vote, even when socioeco- 
nomic status is controlled (Kellstedt et al 1996c, Smidt et al 1996, Pyle 1993). 

In the 1960s a fourth movement developed within conservative Protestant- 
ism—the charismatic renewal. Like Pentecostalism, the charismatic move- 
ment emphasized speaking in tongues and miraculous healing. But unlike Pen- 
tecostalism, it spread among Catholics and members of mainline and evangeli- 
cal denominations. Charismatics were also more middle-class and Northern 
(Wilson 1984) and probably as a result were less separatistic (Anderson 1987). 
Thus, unlike Pentecostals, charismatics generally stayed within existing de- 
nominations and became a bridge among evangelicals, Pentecostals, mainline 
Protestants, and Catholics. Pentecostals have increased in socioeconomic 
status to levels similar to those of charismatics and share many beliefs with 
them, but still most charismatics do not identify as Pentecostals and vice versa 
(Smidt et a1 1996).10 

Charismatics and Pentecostals comprise approximately 12% of the US 
population!! and have had a profound impact on African, Latin American, and 
Asian Christianity. They are the fastest growing segment of Christianity 
worldwide, and yet social scientists have paid little attention to them (Smidt et 
al 1996, Anderson 1987). Most research surveys don't ask even a single ques- 
tion that would identify them (e.g., questions about speaking in tongues, or re- 
ligious movement identity, or detailed denominational categories). Thus, most 
scholars and journalists did not realize that in the 1988 presidential campaign 
Pat Robertson's political support was limited almost exclusively to the "Spirit 
filled" (Green 1996, Oldfield 1996, Smidt & Penning 1990).!4 

Although these distinctions within conservative Protestantism may not 
seern important to many contemporary social scientists, they are important to 
adherents, and they have important social and political implications (Smith et 
al 1998, Green et al 1996b, Kirkpatrick 1993). Moreover, in the United States, 
each of these four groups is probably as large as all non-Christian religious 
groups combined and none shows any sign of declining. Thus, presumably 
they deserve a similar amount of scholarly attention. 


Regional Variation 


There are also 1mportant regional variations among CPs. Southern CPs have 
their own unique religious style and distinct networks of institutions. They also 


l0The mayor theological distinction between the two 1s that Pentecostals believe that those who 
do not speak in tongues have not been baptized by the Holy Spint, charismatics disagree 

1] Many of the estimates ın this paper are based on phone polls Phone polls oversample regular 
church attenders and thus may oversample some religious groups as well (Woodberry 1998). 

12For annotated bibliographies and resource lists on charismatics and Pentecostals see Mills 
1985, Dayton 1985, Burgess & McGee 1988, and Jones 1983,1995 
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tend to be more separatistic and socially conservative than Northern CPs (Shi- 
bley 1996, Ammerman 1990, Carpenter 1984). This is the result of the South's 
distinct historical experience. 

During the colonial period, evangelicalism was weak in the South. Most 
white Southerners were Anglican or nonpracticing, and they viewed evangeli- 
cals with suspicion. Initially, Northern evangelical missionaries to the South 
openly condemned slavery, had mixed-race meetings, and emphasized more 
egalitarian gender roles. The Methodists even attempted to bar all slave hold- 
ers from membership. However, these practices caused such opposition that 
evangelicals moderated them to avoid alienating potential followers (Heyrman 
1997). As the movement became indigenous, Southern evangelicals increas- 
ingly defended slavery based on a literal interpretation of scripture. Southern 
evangelicals also resisted Northern evangelical social reform movements and 
perfectionist theology because they were associated with abolitionism (Car- 
wardine 1993, Smith 1957). As the Civil War approached, most evangelical 
denominations and mission boards split over slavery (Northerners dominated 
national-level organizations because of their larger population), and a number 
of separatist religious movements swept through the South that rejected church 
organization beyond the local level as unbiblical. These movements had a last- 
ing impact on Southern religion (Ammerman 1990, pp. 33-34, 343). 

The toll of the Civil War on the South was enormous and left deep resent- 
ments against Northern cultural impositions. In the cultural crisis that followed 
defeat, Southerners struggled to regain the honor and identity they had lost. 
Many believed that their unfaithfulness to God’s law had caused their defeat; 
for God to bless them again, they needed to uphold his law more diligently than 
ever before. The result was a series of revivals that swept through the South. 
Southerners emphasized that they were not morally inferior; on the contrary, it 
was the Northerners who had become increasingly involved in secular social 
reform, while the South upheld the true faith (Hill 1980, p. 106, Ammerman 
1990, pp. 38-39).!? Southern denominations perpetuated distinctions from 
their Northern counterparts and successfully lobbied Southern states to block 
authorization of Northern home missionaries in the South (Ammerman 1990, 
pp. 37-38). Thus, many denominations remained organizationally discon- 
nected from the fundamentalist/modernist split and the later neoevangelical 
movement. Meanwhile, the North was becoming increasingly diverse, while 
the South stayed more religiously homogeneous and CPs maintained a de- 
nominate position in society (Hill 1980). Because of this homogeneity, few 
Southerners objected to prayer or Bible reading in public schools (Shibley 


13Not only did these experiences influence Southern symbols and schema, they influenced 
Northern ones as well. Even today Northerners often associate Southern religion with racism and 
backwardness 
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1996, Ammerman 1990, p. 55); laws against such practices easily appeared as 
further examples of Northern cultural domination. 

As Southerners migrated to the North and West in search of opportunity, 
they often felt uncomfortable with Northern churches and set up churches 
similar to those back home (Shibley 1996, Ammerman 1990).!4 These migra- 
tions occurred in several waves (i.e., after the Civil War, during the Great De- 
pression, and after World War II). In the 1930s, impoverished Dust Bowl 
farmers migrated from Arkansas, Oklahoma, and Texas to the agricultural re- 
gions of California and the Pacific Northwest (Shibley 1996, p. 34). Today 
nearly one eighth of Californians, and nearly half of those who reside in the 
San Joaquin Valley, can trace their ancestry to Oklahoma (Haslam 1989). 
Other Southerners migrated to the factories of Ohio, Indiana, and southern 
Michigan. Many Pentecostal, fundamentalist, and Southern Baptist churches 
in the North and West were planted as a result of these migrations. By 1990 
there were over 7300 Southern Baptist churches outside the South with over 
2.8 million members (Shibley 1996, Ammerman 1990, pp. 50-51).15 Al- 
though many of the members of these churches do not have Southern ancestry, 
these churches remain in contact with their roots through the seminary training 
of pastors, networks of relationships, denominational bulletin inserts, Sunday 
school material, etc. These networks facilitate the flow of new theological, so- 
cial, and political ideas between the North and the South. 

In spite of the differences described by historians and ethnographers, few 
social scientists have analyzed differences between Northern and Southern 
CPs. Strangely, even students of politics in the South have largely neglected 
religious factors (Green et al 1988a). Yet many of the political leaders of the 
religious right (RR) are Southerners, and some of the issues associated with the 
movement seem strongly influenced by regional identity (Woodberry et al 
1996).16 Moreover, the massive movement of Southerners from the Demo- 
cratic to Republican party may be central to the rise of the New Right. 

There is important variation among Southern CPs as well (e.g., Jimmy Car- 
ter, Billy Graham, and Al Gore are all Southern CPs), and the RR has gained 


l^The reverse has also happened more recently with Northerners moving to the South. 

I5Dozens of Southern CPs have also become famous as televangelists —e.g , Oral Roberts, Jimmy 
Swaggart, Jim Bakker, Pat Robertson, and Jerry Falwell (Shibley 1996, Oldfield 1996, p. 135) 

Jerry Falwell, Pat Robertson, Ohver North, Donald Wildmon, Ralph Reed, James Kennedy, 

and Jesse Helms are all Southerners, many RR organizations are/were headquartered in the South 
(e.g., the Moral Majonty, the Christian Coalition, the American Family Association, World 
Magazine, the Christian Action Council, Operation Rescue), and many of the most conservative 
leaders of the Republican revolution in Congress are also from the South (e.g., Newt Gingrich, 
Trent Lott, Dick Armey, Strom Thurmond, Tom DeLay, Jesse Helms), although many are not part 
of the RR (Nixon 1996). The areas of California, Oregon, Washington, Michigan, and Ohio with 
the strongest RR actrvism seem to coincide with areas of high Southern migration, although 
establishing this connection would require further research 
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support among non-Southerners. However, 1t seems plausible that the histori- 
cal Southern experience of defeat and ridicule could have shaped the political 
beliefs of Southern CPs differently than Northern CPs, just as the African- 
American experience of oppression has shaped the political beliefs of black 
CPs differently than white CPs. Some (although not all) of the issues associ- 
ated with the RR seem to be influenced by this experience. The concerns of the 
RR cannot be reduced to class and regional conflicts, but they may be accentu- 
ated by them. !7 


MEASUREMENT 


Contemporary sociological research about CPs is generally not as developed 
as historical research. Few social scientists or journalists are religiously ac- 
tive—especially in CP denominations (Wuthnow 1985, Lichter & Rothman 
1981, Thalheimer 1973)}—and perhaps because of this lack of familiarity, re- 
ligious factors are frequently ignored or measured poorly (Kellstedt et al 
1996a, Woodberry 1997, Hart 1996, C Smith 1996, Larson et al 1994, Thomas 
& Cornwall 1990, Wuthnow & Hodgkinson 1990, Gorsuch 1988, Hood 1983). 
Further, distinguishing fundamentalists, evangelicals, charismatics, and 
Pentecostals is complex. Researchers can divide denominations into mainline, 
CP, and black Protestant denominational families. They can also distinguish 
Pentecostal denominations. However, they cannot easily identify fundamen- 
talists, evangelicals, and charismatics in this way. Some denominations are 
clearly part of each of these movements, but these movements cross-cut de- 
nominations and even denominational families. They are better understood as 
loosely connected networks of ministerial associations, parachurch organiza- 
tions, schools, seminaries, magazines, etc. Only recently have scholars begun 
developing measures to differentiate these. groups (see Green et al 1996b, 
chapters 10—13); most research focuses only on identifying CPs as a whole. 
The three major approaches to identifying CPs are denomination, beliefs, 
and self-identification. Although these approaches overlap, they have impor- 
tant distinctions as well. These differences cause many of the apparent contra- 
dictions in the literature. Indeed, whether “fundamentalists” are less educated 
than the general population depends on whether they are identified by denomi- 
nation or doctrine or are self-identified (Kellstedt & Smidt 1996). Without 
multiple measures these distinctions are impossible to identify. However, this 


17 Unfortunately Southern identity 1s usually measured poorly on surveys. Given the mobility 
between the North and South, the differences between black and white Southerners, and the great 
variation in the strength of people’s regional identity, merely controlling for state of residence does 
not seem sufficient. 
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type of detail is extremely rare. On most surveys, it is fortunate if there is even 
one good measure. 

Another weakness is that scholars do not generally connect qualitative in- 
terviews with quantitative survey data. Thus, scholars have difficulty deter- 
mining how representative case studies, "insider documents," or interviews 
are, and they have difficulty deciphering the rich contextual meanings of iso- 
lated survey responses. This increases the danger that scholars will either over- 
generalize the views of small, unrepresentative groups and leaders or project 
alien meanings on respondents' statements and practices. For example, some 
of CPs' positions on government intervention seem contradictory but make 
more sense once scholars understand the moral metaphors that motivate them 


(Lakoff 1996). 
Denominational Affiliation 


The most commonly available measure of CPs is denominational affiliation. 
Unfortunately it is seldom measured with sufficient care or precision. For 
example, Northern Baptists (i.e., American Baptists in the USA and General 
Conference Baptists) are more moderate than Southern Baptists, Missionary 
Baptists, and Independent Baptists. Missouri Synod and Wisconsin Synod Lu- 
therans are more conservative than the Evangelical Lutheran Church in Amer- 
ica.!9 The generic categories Baptist, Lutheran, Presbyterian, etc, mask these 
distinctions and muddy statistical results. Without detailed denominational 
categories and specific follow-up questions, valuable information is lost and 
scholars have difficulty recoding respondents into CP, mainline, and black 
Protestants (let alone into Pentecostal or Northern- and Southern-based de- 
nominations). When respondents say they are “Lutheran” or “Presbyterian,” 
researchers need to ask, “Which kind?" 

The next problem is separating this jumble of denominations into useful 
categories. Perhaps the most commonly used categorization 1s that of Roof & 
McKinney (1987). However, their denominational categories are broad and in- 
complete and can cause significant measurement error. Major denominations 
like the Nazarenes are not even listed. One advantage of their approach, how- 
ever, is that black Protestant denominations are separated from white ones. 

The General Social Survey (GSS) coding is more complete but also has 
problems (especially before 1983). It recodes all denominations into the cate- 
gories “fundamentalist,” “moderate,” and “liberal” (see T Smith 1990). This 
applies the category “fundamentalist” to over 30% of the US population, a la- 
bel most would actively deny. This also assumes that all respondents can be 


IBmdeed, most surveys record only whether a respondent is "Protestant," "Catholic," "Jewish," 
or “Other " These categories are too broad to be of much use (Demerath & Roof 1976, Kellstedt et 
al 1996a) 

thus case "Evangelical" refers to the Protestant Reformation in Germany. 
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categorized along a single continuum based on the fundamentalist/modernist 
split of the 1920s, which is questionable even among Protestants, let alone 
Buddhists and Hindus (Kellstedt et al 1996a). The GSS codes all Catholics as 
“moderates” and all nonaffiliated as “liberal.” Political views also seem to in- 
fluence how some denominations and religious groups are coded. For example, 
many black CP denominations are coded as “moderates,” although theologically 
they probably belong in the “fundamentalist” category, and all Jews are coded 
as “liberal,” even those who are theologically conservative (Regnerus et al 
1997).2° The GSS also lumps “nondenominational” Protestants in the same 
category as those with no denomination. This is unfortunate because the growth 
of nondenominational Protestants is one of the most important and least docu- 
mented religious changes taking place in the United States. The overwhelming 
majority are CPs and do not belong in the “moderate” or “liberal” category. 
Those who claim “no denomination” do not resemble them either socially or 
politically (Smidt et al 1996, Shibley 1996). 

Fortunately, political scientists have developed some excellent tools to dis- 
tinguish CP denominations. Kellstedt et al (1996a) is probably the most thor- 
ough published version, but similar categorizations are available on the 
American National Election Studies 1996 cumulative file, the fall 1996 South- 
ern Focus Poll, and several Pew data sets.?! Interviews with those who con- 
ducted these surveys suggest that detailed denomination questions take little 
survey time and create little respondent or interviewer burden. Developing 
CATI skip patterns and recoding denominations into useful categories are 
complicated, but these can be copied from existing surveys. These detailed 
categorizations also allow scholars to distinguish Pentecostals, black Protes- 
tants, and Northern- and Southern-based denominations. 


Beliefs 


Although many surveys have questions about religious belief, few of these are 
useful for distinguishing CPs. “Biblical literalism” or “biblical inerrancy” are 
the most common beliefs used, but these are usually measured poorly. First, 
the conservative response categories are generally too broad. About 35% of 
the US population claim the Bible should be interpreted “literally,” and many 
more say it is true but should not always be interpreted literally (both of these 
are theologically appropriate responses for CPs). However, there are major 
differences in what people mean by “literal” or “without error” (Bartkowski 
1996, Shibley 1996). The limited response options mask these differences. 


20Controlling for race does not solve the problems caused by the lack of a black Protestant 
category. 

pam (1989) describes the origins and beliefs of almost every religious group in the United 
States It 1s helpful for more detailed categorizations of denominations. Many small denominations 
are not coded on most surveys, including the GSS 
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Second, many better-educated evangelicals are not literalists or inerrantists. 
They prefer to say the Bible is "trustworthy for all matters of faith and prac- 
tice.” As a result, biblical literalism is more strongly correlated with education 
than either CP movement identification or denominational affiliation 
(Kellstedt & Smidt 1996). Thus, when scholars use these measures to identify 
CPs, they influence CPs' demographics and attitudes. 

Moreover, using a single belief to categorize a complex group like CPs 
causes substantial measurement error; thus coefficients are no longer BLUE 
(i.e., unbiased and consistent) (Bollen 1989). When possible, it is better to 
measure conservative Protestantism as a latent variable using several beliefs. 
Generally CPs emphasize a personal relationship with Jesus Christ, believe in 
the importance of converting others to their faith, have a strong view of biblical 
authority, and believe that salvation is through Christ alone.2? Unfortunately, 
few surveys have such detailed belief questions. Almost none ask respondents 
whether they think it is important to convert others, whether they speak in 
tongues, or about their views on Jesus Christ, biblical prophecy, or separation 
from the world. Yet historians use many of these exact measures to distinguish 
different types of CPs (see e.g., Marsden 1987a). This makes connecting his- 
torical and social science literature difficult. Moreover, many of these beliefs 
predict people's behavior better than the more typical questions about whether 
they believe in God or the afterlife (about 70—90--95 say they do—too many to 
be very useful), and certainly more than the many detailed questions the GSS 
asks every year on how people view God or heaven (e.g., “Do you see God as 
more a savior or redeemer?” “More as a creator or healer?"). Even when sur- 
veys have multiple indicators, most researchers use them separately or addi- 
tively (e.g., CPs are biblical literalists, who are “born again" and have shared 
their faith with others). This also causes measurement problems because, for 
example, CPs may be biblical literalists and "born again," but too shy to share 
their faith. Measuring conservative Protestantism as a latent variable avoids 
these problems. Still these problems are minor in comparison to the typical sin- 
gle measure approach. 


Movement Identification 


Until recently, few surveys asked respondents whether they identify with par- 
ticular religious movements (e.g., fundamentalism, evangelicalism).2? Yet 


22For a more detailed discussion of identifying CPs with belief measures, see Green et al 1996b, 
caida 10—13. They also discuss ways to identify subgroups within conservative Protestantism 
Some surveys ask if respondents are “born again," but this has several disadvantages. First, it 
provides less detailed information than a movement ID question, second, the number of people who 
are "born again” varies significantly based on context and definrtion; and third, many avoid the 
label because they cannot pinpoint the time of therr conversion or because of negative connotations 
(see Schumm & Silliman 1990). 
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movement ID generally predicts people’s attitudes and political behavior bet- 
ter than denomination or generic religious beliefs (Smith et al 1998, appendix 
B, Kellstedt et al 1996c). Among Protestants, we think the most useful catego- 
ries are “theologically liberal,” “mainline,” “Pentecostal,” “charismatic,” 
“evangelical,” and “fundamentalist.”24 Surprisingly, given the widespread use 
of the term “mainline-liberal,” almost no one chooses both of those self-IDs. 
Few people also choose both the charismatic and Pentecostal IDs (also see 
Smidt et al 1996). 

Despite the predictive power of movement IDs, there are some potentia] 
problems with using them. First, people can be part of a religious movement 
without knowing it. Second, some people have multiple IDs. Third, the terms 
may mean different things to different people. For example, some may 
choose the label “fundamentalist” because they believe the fundamentals (e.g., 
the virgin birth of Christ, the authority of the Bible) and not because they are 
separatistic, premillenial dispensationalists, or connected to ministers like 
Jerry Falwell, Bob Jones, or Jack Hyles. Some politically liberal CPs may also 
avoid using the term “evangelical” because of its association with fundamen- 
talism and the RR (see Kellstedt et al 1996c, p. 259, Johnston 1991). Even 
those who identify themselves as evangelicals do not always feel close to 
them, presumably for these same reasons. Fourth, the meaning of the terms is 
gradually changing as people try to associate themselves with particular 
groups (or distance themselves from them). Since the late 1970s, journalists 
and scholars have applied the term “fundamentalist” to Muslim religious 
movements, some of which use terrorist tactics. In the 1990s, Ralph Reed and 
the Christian Coalition began to use the term “evangelical” (Jerry Falwell was 
a fundamentalist and Pat Robertson a Southern charismatic/Pentecostal). 
Jouranlists covering the RR use a variety of terms, often indiscriminately. In 
time, these public uses of the terms may change who is willing to use them. 

However, despite these possible problems, our analysis suggests that IDs 
work relatively well. As part of the Religious Identity and Influence Survey, 
we selected a sample of Protestant respondents from the phone survey and con- 
ducted qualitative face-to-face interviews with them. Their explanations of 
what they meant by “evangelical,” etc., generally matched our expectations 
(although many “fundamentalists” are not as separatistic as expected). For all 
measures, this type of checking is important because respondents often do not 
mean what researchers assume. 


24-Mainlime" 18 probably not a religious movement per se, but it allows respondents to avoid 
choosing one of the other movement labels 
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Religiosity 

Although some surveys have multiple measures of religiosity, few researchers 
use more than one. They generally do not analyze the multiple dimensions of 
religiosity or combine identifiers to measure latent religiosity variables. Many 
also analyze religiosity without analyzing religious affiliation or any other as- 
pects of religion (Thomas & Cornwall 1990). This assumes that religiosity is 
generic, that it does not matter what people believe or what the social context 
of their worship is. However, sometimes denomination strongly influences the 
impact of church attendance. Why people attend is also important. For exam- 
ple, those who attend for personal or religious reasons tend to be less preju- 
diced against ethnic minorities; those who attend for social reasons are more 
prejudiced (Gorsuch 1988). However, when researchers put multiple measures 
of religious involvement in the same regression, they must carefully decipher 
what the individual religion variables represent. For example, it is not clear 
what conservative Protestantism net of the impact of religiosity is. 


Using Multiple Measures 


Green et al (1996b, 1997) have found even stronger results by combining all the 
above measures. For example, those who attend a CP denomination, identify 
with a CP movement, have conservative religious beliefs, and have higher re- 
ligiosity have more conservative abortion attitudes and are more Republican 
than those who are missing some of these CP identifiers. However, this type of 
analysis works better with crosstabs and multiple classification analysis than 
with regression. In structural equation modeling, these multiple measures 
could be used to identify an underlying latent variable or variables. 


SUBSTANTIVE AREAS 


Gender and Family 


Evangelical and Pentecostal groups were the first to ordain women, and some 
Pentecostal denominations still ordain higher proportions of women than any 
other denominations (Dayton 1991, Chaves 1996, Lindley 1996). Still, most CP 
groups have resisted thus and limit the participation of women in church leader- 
ship. Lobbying by certain CP groups also effectively prevented the bestselling 
New International Version of the Bible from changing its translation to use 
gender-inclusive terms. Moreover, CPs are generally more supportive of tradi- 
tional roles for women (Gay et al 1996), and a higher proportion of them are full- 
time homemakers. A vast literature has developed about this. Yet little of it 
contemplates why, in spite of this, women are more active in CP churches than 
men (Pevey et al 1996) and also more active in missions and evangelism for 
these groups (Hutchison 1987, Neill 1986, Ryan 1978; also see Brusco 1995). 
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Recently, several scholars have begun to examine the complexity in pre- 
scribed women's roles among CPs (Bartkowski 1996, 1997, Ingersoll 1995) 
and to interview CP women about their experience and acceptance of these 
roles. These studies show that CP attitudes and practices in the area of gender 
and family life (e.g., household division of labor, female employment, marital 
decision making) are more nuanced and negotiated than previously recognized 
(Brasher 1997, Ellison & Bartkowski 1997, Gallagher 1996, Pevey et al 1996, 
Ozorak 1996, Demmitt 1992, Rose 1987, McNamara 1984) and that there is 
significantly more disagreement about gender-role attitudes among CPs than 
in the general population (i.e., their standard deviation is significantly larger) 
(Gay et al 1996). This complexity is often missed in survey analysis because of 
distinct subcultural usages of terms like “headship” (Gallagher 1996, Pevey et 
al 1996). 

Many scholars also theorize that CPs are more patriarchal and therefore 
more likely to physically or verbally abuse their wives (e.g., Straus et al 1988, 
pp. 7-8, 21). However, both self-report and spousal-report suggest that CP men 
are not more likely than other religious groups to physically abuse their wives 
and are significantly less likely than the religiously nonaffiliated (Ellison et al 
1996, Brinkerhoff et al 1992; also see Straus et al 1988, pp. 128, 138).25 Nei- 
ther are they more likely to use verbal or symbolic aggression (Brinkerhoff et 
al 1992). Moreover, most available evidence suggests that regular church at- 
tenders are significantly less likely to abuse their spouse (Ellison et al 1996, 
Ferguson et al 1986), although in one Canadian study the relationship is curvi- 
linear, with irregular attenders being the most abusive (Brinkerhoff et al 1992), 

Although few scholars have analyzed CP family involvement (Wilcox 
1997), research does show that “biblical literalists" are more likely to value 
obedience in children and to use corporal punishment (Ellison & Sherkat 
1993a,b, Ellison 1996).26 Some researchers suggest that as a result, CPs may 
legitimate and encourage child abuse (Maurer 1982, Capps 1992). However, 
Ellison (1996) argues that the available evidence does not demonstrate this. 
According to Ellison, most studies on the negative impact of corporal punish- 
ment are seriously flawed, especially as it applies to conservative Protestants. 
These studies combine spanking, beating, threats, and assault with weapons, 
and they do not control for attenuating factors like parental involvement, affec- 
tion, and communication. Definitive conclusions are difficult because almost 
no research analyzes the impact of mild-to-moderate corporal punishment. 
Generally, CP authors carefully delimit how and when corporal punishment 


25The exceptions to this rule are the relatively few cases in which the husband attends church 
far more or is far more theologically conservative than his wife In these cases husbands are more 
likely to use violence (Ellison et al 1996). Studies disagree about whether CP women are more or 
less likely to use violence against their husbands (Brinkerhoff et al 1992, Ellison et al 1996). 
ey are not less likely to value intellectual autonomy (Ellison & Sherkat 1993b). 
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can be used (i.e., only for willful disobedience by pre-teens, not when parents 
are angry or feel loss of control, and only when the reason for the punishment is 
carefully explained and followed by “a period of loving intimacy") (Ellison & 
Sherkat 1993a, Ellison 1996, Bartkowski 1996). However, ordinary people 
may not necessarily follow these injunctions. Still, recent research on father- 
hood by W Bradford Wilcox (1997) suggests that CP fathers are more likely to 
hug and praise their children, less likely to yell at them, and are among the 
most involved in their children's daily lives. 2” This may or may not attenuate 
the possible negative consequences of corporal punishment. 

Thus, CP women may be willing to accept more traditional roles in order to 
get men to be more committed to their marriage and child-raising. Qualitative 
research suggests that CP women use biblical injunctions to hold their husbands 
to higher standards of emotional intimacy and support (Gallagher 1996, Pevey et 
al 1996). In any case, both regular church attenders and CPs report greater life 
satisfaction and greater commitment to and dependence on their marriages 
(Larson & Goltz 1989, Wilson & Musick 1996, Ellison 1991, Ellison et a1 1989). 
The more religiously active also have greater marital stability and report higher 
levels of marital satisfaction (Call & Heaton 1997, Thomas & Cornwall 1990, 
Lehrer & Chiswick 1993, Filsinger & Wilson 1984, Hunt & King 1978). Studies 
disagree over whether CPs have higher or lower marital satisfaction and stabil- 
ity than other religious respondents (Call & Heaton 1997, Schumm et al 1989, 
Wilson & Filsinger 1986, Chi & Houseknecht 1985, Hunt & King 1978). 
Much of the difference seems to depend on how conservative Protestantism is 
measured, what is controlled for, and to whom CPs are compared. 


Tolerance 


Much research suggests that CPs have selective tendencies toward greater in- 
tolerance than do other Americans (e.g., Gay & Ellison 1993, Wilcox & Jelen 
1990). CPs are not more anti-Semitic than other Americans. Two studies com- 
missioned by the American Jewish Committee show CPs are stronger support- 
ers of Israel, feel closer to Jews, are more likely to view Jews as God’s chosen 
people, and have high social and political acceptance of Jews (e.g., 96% would 
vote for a Jewish president). Their major “intolerant” attitude is that they gener- 
ally believe Jews must accept Jesus to be saved (although they believe this for all 
non-Christians, not just Jews) (T Smith 1994, 1996, Whalen 1996, Guth et al 
1996a,c, Abrams 1997). CPs are also not more racist than other Americans.?8 


7T However, religiosity is not controlled in this analysis; thus greater religiousity rather than 
CPism may be the important factor. 
8Some extremist-nght groups use Christian rhetonc. However, these have little support or 
recognition among CPs. For example, of politically/socially conservative white CPs who know 
David Duke, only 5% "admire" or “somewhat admire" him, compared to 6% of the rest of the white 
population (T Smith 1994, 1996). 
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They differ little from other Americans on attitudes toward blacks, Asians, 
Catholics, Hispanics, or immigrants (T Smith 1996, Davis & Robinson 1996, 
also see Guth et al 1996c, pp. 308—9).7? 

Conservative Protestants do hold less tolerant views of militarists, racists, 
atheists, homosexuals, feminists, and Moslems (Wilcox & Jelen 1990, Tam- 
ney & Johnson 1997, T Smith 1996). They are also more likely to think "dan- 
gerous" books should not be in the public libraries. These kind of findings are 
reflected in most tolerance scales. 

However, some scholars argue that much of this tolerance research is 
flawed (e.g., Sullivan et al 1982, Smidt & Penning 1982, Davis & Robinson 
1996, Jelen & Wilcox 1990, Hood 1983). First, some studies do not have ade- 
quate controls. Second, they have no measures of behavior. Some "tolerance" 
items may measure attitudinal “moral relativism” rather than actual behavioral 
graciousness toward those with whom one disagrees. Third, most studies treat 
tolerance as a single unidimensional attitude that people either possess or do 
not, despite the fact that different groups may be tolerant to some target groups 
but not others. Fourth, Smidt & Penning (1982) argue that most tolerance 
scales (e.g., the GSS tolerance scale) are biased against CPs (also see Sullivan 
et al 1982, pp. 137, 249—50).9? A better method of measuring tolerance is to let 
respondents choose the group they dislike most. This methodology eliminates 
the difference in tolerance between Protestants, Catholics, and Jews and 
greatly reduces the difference between liberals and conservatives (Sullivan et 
al 1982, pp. 135-39). Unfortunately it is difficult to determine what happens to 
CPs' tolerance level because the measures of religion in Sullivan et al are so 
crude. 

However, even if scales were less biased, on average CPs would probably 
be less tolerant. The fact that CP intolerance does not appear to be ideologi- 
cally selective—they are as intolerant of militarists and racists (Wilcox & Je- 
len 1990, Tamney & Johnson 1997) as of atheists and homosexuals— suggests 
that the distinction between CPs and others may ultimately derive from com- 
peting moral visions. Standing in the old tradition of political conservatism (á 
la Burke), CPs appear to prioritize the conservation of what is in their view the 
moral basis of the “common good,” over the liberal ethic of tolerance. 


29The relationship between religiosity and prejudice 1s curvilinear Those who are peripherally 
involved in religion are the most prejudiced, and those who are heavily involved the least. People's 
motrvations for attending are also important (Gorsuch 1988, Kirkpatrick 1993). Although some 
scholars question their motrves and methods, CP groups like Promise Keepers have made 
significant attempts to promote racial reconciliation (e.g , see Olsen 1997) 
ese scales ask about attitudes toward homosexuals, atheists, communists, and fernmusts 
(left-wing target groups), but not about people's attitudes toward fundamentalists, gender-role 
traditionalists, etc (right-wing target groups). They ask whether it 1s okay to have books that 
advocate atheism in the public library but not about books that advocate conversion to Christianity 
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Finally, however, even using the traditional tolerance scales, there is a sig- 
nificantly higher variance in tolerance attitudes among CPs than in the general 
population (Gay & Ellison 1993). This suggests that some subgroups within 
conservative Protestantism are highly tolerant and others are highly intolerant. 
Much of this intolerance seems to result from fundamentalism rather than re- 
ligious orthodoxy per se (Green et al 1994, Kirkpatrick 1993). Tolerance is 
also influenced by how much a group feels marginalized and threatened 
(Green et al 1994, Sullivan et al 1982, p. 251). 


Culture Wars? 


With the rise of the religious right (RR), many scholars and journalists have be- 
come convinced that America is involved in a culture war in which CPs play a 
leading role (e.g., Hunter 1991). However, at the grass roots this does not appear 
to be the case (C Smith et al 1998, Williams 1997, Manza & Brooks 1997, Di- 
Maggio et al 1996, Davis & Robinson 1996, Iannaccone 1993). After analyzing 
multiple attitudinal measures on the GSS and NES, DiMaggio et al (1996) con- 
clude that the attitudes of liberals and conservative religionists have actually 
converged since the 1970s. The only area of increased polarization they found is 
in abortion attitudes. Since the 1970s, liberals have become more pro-choice and 
more unified in their views; conservative religionists have become more inter- 
nally polarized (pp. 733—34). The gender-role and racial attitudes of both groups 
have also become more liberal and less polarized over time. The authors analyze 
a number of other possible cleavage lines in society, but only party ID showed 
increased cleavage (1.e., the attitudes of self-identified Democrats and Republi- 
cans have become more polarized). This may have resulted because conserva- 
tive Southern Democrats have increasingly joined the Republican party, which 
makes Democrats appear more liberal and Republicans more conservative. 

There is also little evidence of a massive increase in the number of CPs 
(Guth 1996a, T Smith 1992, Wuthnow 1992). Some people convert to CP de- 
nominations, but higher birth rates and greater retention of CP children cause 
much of the relative growth. However this has been a slow, gradual process 
(Guth 19968, Finke & Stark 1992). 

Moreover, according to Manza & Brooks (1997) white CPs have not become 
more Republican or increased their presidential voting behavior between 1960 
and 1992, although liberal Protestants have become increasingly Democratic. 
This seems to contradict the research of most other scholars (e.g., Kellstedt et 
al 1996d, Layman 1997). However, different starting years and methodologies 
seem to have caused these apparent contradictions.?! Manza & Brooks started 


3lManza & Brooks divided Protestant into conservatives, moderates, and liberals. Kellstedt et 
al divided them mto evangelicals and mainliners However, this does not seem to explain the 
different results. Manza & Brooks' analysis shows no trend for either conservative or moderate 
Protestants (which include the denominations 1n Kellstedt et al's evangelical category). 
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their analysis in 1960; most others start around 1980 (e.g., Layman 1997). 
However, Manza & Brooks' analysis shows that CPs moved strongly into the 
Democratic party to support fellow evangelical Jimmy Carter in both the 1976 
and 1980 elections. Thus, this temporary Democratic shift 1s the anomaly, not 
their gradual drift back into the Republican party. 

The second difference 1s the type of controls used. Manza & Brooks con- 
trolled for the overall trend in presidential voting; the others did not. On aver- 
age, the general population voted more Republican in presidential elections 
between 1960 and 1988; thus, in cross-tabulated analysis without controls, 
CPs seem to have become more Republican and mainline Protestants to have 
remained unchanged in their political loyalties (see Kellstedt et al 1996d). 
However, this does not necessarily contradict a proper reading of Manza & 
Brooks’ tables. According to their tables CPs have become no more Republi- 
can than the general population has, while liberal Protestants have resisted the 
genera] trend. Thus, if the CP trend does not differ significantly from the gen- 
eral population, it seems unlikely that the RR had a major impact on CP presi- 
dential voting or party ID, at least up to the 1992 elections. This interpretation 
is reinforced by the fact that CP voter participation peaked in the 1976 and 
1980 Carter elections and has declined since (Manza & Brooks 1997).3? 

Why then do CPs have so much greater public visibility? First, CPs' 1n- 
creased wealth and education have led to increased community involvement 
and greater organizational skills. Now CPs as a whole are almost even with 
mainline Protestants in education, income, and middle-class identity (Guth 
1996a). Some groups of CPs always were middle-class and college educated, 
but others, like Pentecostals, have made significant recent gains. Second, some 
CPs have set up political pressure groups that attempt to speak for the CP com- 
munity and have mobilized activists to participate in the Republican party or- 
ganization (which previously was dominated by mainline Protestants). These 
organizations allowed previously marginalized groups to inject their concerns 
into the public arena (see Regnerus & Smith 1998). They also facilitate rapid 
dissemination of political information to CP constituents. As with the rise of 
Catholic involvement in politics earlier ın this century, the new political visi- 
bility of CPs has raised concern among older elites. Third, CPs are more visi- 
ble because of a restructuring of the political spectrum. Traditionally, Catho- 
lics, Jews, minorities, and. Southerners voted Democratic, and white non- 
Southern Protestants voted Republican. However, according to Manza & 
Brooks (1997), liberal Protestants have moved strongly toward the Demo- 
cratic party and conservative Southerners into the Republican party. This has 
radically shifted party coalitions and rhetoric. Thus, whites, Jews, secularists, 


32This analysis focuses on presidential elections CPs may have increased their voting 1n non- 
presidential elections, or since 1992 Research by Green et al (1996a,b) suggests that this 1s the case 
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and social liberals have become the new core of the Democratic party. This has 
left more conservative Protestants as the core of the Republican party (Manza 
& Brooks 1997, Green et al 1996b, p. 4).?? 

According to Manza & Brooks, decomposition of the regression coeffi- 
cients suggests that this shift in the party alignment of liberal Protestants was 
caused by their increasingly liberal views on social issues, especially abortion 
(no doubt the rise of the RR helped speed them on their way). The shift of lib- 
eral Protestants and Southerners has increased the internal consistency of both 
parties on social issues and made it 1n the interest of party elites to manipulate 
religious and regional issues to mobilize political support and demobilize op- 
position. Social issues may mobilize some CPs, but fear of the RR is also an ef- 
fective counter-mobilization tool (Wald 1995, Rozell & Wilcox 1995c). 


The Religious Right’! 


In the 1960s and 1970s, a number of CPs became increasingly concerned about 
the direction in which the United States was moving. Stricter enforcement of 
the separation of church and state, the legalization of abortion, the sexual revo- 
lution, the gay rights movement, the removal of religion from public educa- 
tion, the increasing sexual explicitness of TV and movies, the spread of por- 
nography, and challenges to traditional gender roles all raised concern (Guth 
19962). These issues were strongly linked to the family and the socialization of 
the next generation of believers (Davis & Robinson 1996, Oldfield 1996).?° 
Also, CPs’ growing socioeconomic status allowed them to develop stronger in- 
stitutional structures. 

However, the 1976 campaign of Jimmy Carter provided the initial spark. 
Carter was openly evangelical, and CPs moved strongly into the Democratic 
party to support him in the 1976 election. Despite common beliefs to the con- 
trary, CPs disproportionately supported Carter against Ronald Reagan in the 
1980 election (Woodberry et al 1996; also see Manza & Brooks 1997, Lipset & 
Raab 1981, p. 29).26 However, the 1976 election drew the attention of New 
Right strategists (most of whom were not CPs) (Oldfield 1996, p. 100, Guth 


33Thig religious cleavage is even more evident among party elites than at the grass roots 
(Kelistedt et al 1996d). 
SÁFor more extended summaries of research on the RR, see the introductory chapter of Rozell & 
Wilcox 1996, Wilcox 1996, pp. 25—57, Oldfield 1996, and Guth 19968. 
activists often use the rhetonc of a persecuted minonty (Oldfield 1996, pp. 31, 55; Reed 


1994), 

scat’ et al (1996) selected white CPs by denomination and then further separated 
respondents by region and age into Northern and Southern CPs from three different age cohorts In 
1980, all groups except Northern baby-boomer CPs supported Carter at rates similar to those m 
1976 (young Northerners seem the least likely to have been influenced by the RR) Mainline 
Protestants were far more likely to abandon Carter In fact, all mamline groups did, except Northern 
baby-boomers (who increased in their support). 
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19962). They realized that they could win the presidency and a substantial por- 
tion of Congress with the CP vote and began to court its politically conserva- 
tive wing. Atthe same time, repeated intervention of the government in South- 
ern fundamentalist schools (many of which had grown significantly during de- 
segregation), and frequent inquiries by the Securities and Exchange Commis- 
sion and the IRS into the fund-raising of TV ministers, helped convince for- 
mally apolitical leaders like Jerry Falwell and Pat Robertson of the importance 
of political involvement (Guth 1996a). New Right activists eagerly showed 
them the political ropes. This helps explain why the economic rhetoric of the 
RR is more conservative than that of its CP constituency. From its inception 
the RR has been just one part of a new right-wing political coalition (Iannac- 
cone 1993; also see Oldfield 1996, pp. 100, 218-19). 

Despite the attention the RR received in the press, it was initially very weak. 
The Moral Majority, the Religious Round Table, and The Christian Voice pre- 
dominantly mobilized fundamentalist ministers from the Sunbelt (Guth 
1996b).?7 The Moral Majority was composed mostly of Independent Baptists 
and some conservative Southern Baptists and Presbyterians. The leadership 
was almost exclusively pastors in Jerry Falwell's Baptist Bible Fellowship. 
Although mobilizing the ministers in his denomination allowed Falwell to list 
organizations in 47 states (which looked good on paper), most of the organiza- 
tions did very little. Many pastors were either not interested or too busy run- 
ning their churches to do much politically. The Moral Majority gave a total of 
only $25,000 to political candidates and had little grass-roots mobilization, 
even among CPs (Guth 1996a, Guth & Green 1996a, Georgiana 1989).38 A 
survey of contributors to 60 political action committees (PACs) shows that 
conservative political attitudes, not demographic or religious factors, best 
predicted support for the Moral Majority (Guth & Green 1996a). 

Nevertheless, these organizations collapsed within a few years. Initially 
many observers claimed the RR had died, but to their surprise it rose again with 
the presidential campaign of Pat Robertson in 1988. Robertson received few 
votes but surprised journalists and opponents with the size of his activist fol- 
lowing (Oldfield 1996, Green 1996). Other candidates raised more money, but 
he received contributions from more individuals than the rest of the Republi- 
can candidates combined. These supporters were primarily middle-class urban 
Pentecostals and charismatics from the Sunbelt, areas where secular and tradi- 
tional religious values are increasingly in tension (Green 1996). Those who 
voted for him were also primarily Pentecostals and charismatics (Smidt et al 
1996, Rozell & Wilcox 1995b, Oldfield 1996). However, Robertson failed to 


37Pentecostals were also an important element of the Christian Voice (Smidt et al 1996) 
38Even Jerry Falwell’s television audience was small, ranging from about 1,440,000 in 1981 to 
300,000 in 1988 (W Martin 1981, Winzenburg 1988) 
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gain wide support even among CP clergy. A survey of ministers indicates that 
only about one tenth of Southern Baptist ministers and one fourth of Assem- 
blies of God ministers supported him (Green 1996).?? Still, his campaign was 
important because it tapped a largely unmobilized segment of CPs (Pentecos- 
tals and charismatics), established the first truly grass-roots RR organization, 
and created the foundation for the Christian Coalition (Green 1996, Smidt et al 
1996, Oldfield 1996). 

After his failed presidential campaign, Pat Robertson asked Ralph Reed to 
form the Christian Coalition with the remnants of his political organization. 
Reed attempted to broaden the coalition to include evangelicals, Catholics, Mor- 
mons, black CPs, and conservative Jews. He framed the movement in more 
moderate terms, laced his speeches and books with seemingly liberal language 
about the rights of women, condemnation of racism, and repentance for past 
wrongs perpetuated by CPs (e.g., Reed 1994, 1996). The Christian Coalition ac- 
tually established a grass-roots organization and has expanded the coalition to 
include a significant number of evangelicals and Mormons. Support by Catho- 
lics, blacks, and Jews is still minor (Green 1995, Rozell & Wilcox 1995b). As re- 
ligious activists have integrated into the political system and gained more ex- 
perience, they have lost some of their original militancy and become increas- 
ingly willing to compromise (Wilcox 1996, pp. 105-11, Moen 1995, Soper 
1995, Billings & Scott 1994). The RR now has strong influence in the Republi- 
can parties of Utah, Kansas, Nebraska, and ten Southern states. It has contested 
influence in Virginia, Arizona, Iowa, Minnesota, Idaho, and the five Pacific- 
rim states, and modest-to-weak influence elsewhere (Green et al 1998b). 

Despite impressive mobilization of activists, the RR's political impact has 
been minimal. Many of its victories have been symbolic, such as influencing 
the wording of the Republican party platform (Green & Guth 1996, Rozell & 
Wilcox 19952). Most scholars now believe the RR will not disappear but will 
not become a dominant influence in politics either (Oldfield 1996, Wilcox 
1996, Guth 1996a, Rozell & Wilcox 1995a, Green 1995). 

There are several reasons for this. First, the RR has a limited constituency. 
Through the 1980s and 1990s, approximately 1096 to 15% of survey respon- 
dents reported support for RR organizations.*9 A somewhat larger group sup- 
ports specific issues on the RR's agenda; a much smaller number are active 


39'The Assemblies of God is the largest Pentecostal denomination in the United States (Green 


1996). 

A These percentages may be too high According to T Smith (1996), when surveys ask about 
support for the "Religious Right," most respondents do not understand that this applies to a political 
movement. Support for the “Religious Right political movement” 1s much lower than for the 
"Religious Right." Even among supporters of the political movement, 47% had read httle or 
nothing about it, and about 40% lacked awareness or admuration of even one of the major leaders 
(Pat Robertson, Jerry Falwell, or Ralph Reed). 
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participants (Rozell & Wilcox 1995b; also see T Smith 1996, p. 27). Even in 
Virginia, only 15% of voters said endorsement by Falwell, Robertson, or the 
Christian Coalition would increase their support for a candidate; most said it 
would reduce it (Rozell & Wilcox 1995c, p. 125). Yet Virginia is the home of 
Falwell, Robertson, the Christian Coalition, and numerous other RR organi- 
zations. The RR has limited support even within the Christian religious com- 
munity (Jelen 1987, Guth & Green 1996a). Many (if not most) CP ministers 
remain solidly apolitical; only fundamentalist ministers seem to have re- 
sponded with enthusiasm to the RR's call (Guth 1996a, p. 12). Even among 
Southern Baptist ministers, about half think political activism usually hurts the 
church (another third are not sure), and half think some Southern Baptist lead- 
ers have gone too far in mixing religion and politics (Guth 1996b). Many peo- 
ple who welcome religious people into the public square still fear the intoler- 
ance of RR leaders. Although a number of RR issues have broad support in the 
general public, openly identifying with the RR usually hurts candidates politi- 
cally (Gilbert & Peterson 1995, Wald 1995). RR organizations and leaders are 
considerably less popular than the messages they intend to convey (Jelen 
1987). 

Second, there is a residual antipolitical bias among many CPs. Even with 
demographics controlled, CPs as a whole are still less likely to vote 1n presi- 
dential elections than are all other religious groups (Manza & Brooks 1997). 
Many CPs fear politics will dilute the church's spirituality, destroy internal 
harmony, debase its moral authority, and divert attention from evangelism 
(Guth et al 1996b, Oldfield 1996). Large groups like the National Association 
of Evangelicals and influential evangelical magazines like Christianity Today 
have supported selected RR initiatives but have resisted endorsing candidates 
or involvement in elections, and they have consistently counseled moderation 
(Guth 19962). Third, both CPs and the RR are internally divided, with many 
competing organizations, leaders, and agendas (Green 1995).*! CP organiza- 
tions can be found on multiple sides of most political issues. 

Fourth, the RR is isolated from many of the mainstream culture-shaping 
institutions.*? Few Baptist, Pentecostal-Holiness, or other CPs are among the 
"power elite," few are CEOs, and few are listed in Who's Who (Pyle 1996, 
Davidson et al 1995, Manza & Brooks 1997). Few CPs are in the movie or TV 
industries (Rothman & Lichter 1984, Lichter et al 1983); few work at promi- 
nent newspapers, magazines, or TV newsrooms (Jelen & Wilcox 1995, 
Lichter & Rothman 1981), and few work in academia—especially in the hu- 
manities and social sciences (Jelen & Wilcox 1995, Wuthnow 1985, Gorsuch 


^lSee McAdam (1982) for how a similar complexity of organizations hampered the 
effectiveness of the Civil Rights Movement after the mid- 1960s. 
42They do have access to “subcultural shaping” institutions of their own 
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1988).*? Many of these sources also suggest that these elites are more socially 
liberal than the general public and especially than CPs. Thus, the RR has one 
type of power (1.e., votes, donations, activism), but RR activists have little 
power to determine how their movements will be framed in the national media 
or to influence which people and statements will be highlighted to represent 
their causes.“4 Their attempts to censor or influence textbook and media con- 
tent must be done in public through boycotts and letter campaigns; they cannot 
do these things behind the scenes, or make others censor themselves through 
their control of promotions and resources. 

Still the RR enjoys something like a veto power in the Republican party. On 
their own, the constituents of the RR cannot determine the Republican presi- 
dential candidate, and any candidate too closely tied to the movement would 
probably lose. But the Republicans would have difficulty nominating a candi- 
date they directly oppose. Thus, Republican candidates must at least give lip 
service to their cause. ?? 

The RR has primarily been successful where its activists have been part of a 
broad coalition of allies, when their role is not highlighted, where state and 
party organizations have less control over committee memberships, candi- 
dates, and interest group activities, and where they have a mass constituency of 
CPs and other conservatives (Green et al 1998b, Guth & Green 1996a, Green et 
al 1996a, Wald 1995). Thus, they have gained substantial influence on the Re- 
publican party in the South and in Utah. However, in most other places RR ac- 
tivism arouses intense opposition (e.g., California, Oregon, and Minnesota). 
Because of the high commitment of CP activists, they are also influential in 
low-turnout elections and caucuses (where political participation has greater 
cost) (Oldfield 1996). 


Why Conservative Churches Grow 


Dean Kelley's 1972 book, Why Conservative Churches Áre Growing, demon- 
strated that CP churches tend to thrive numerically, while more liberal 
churches languish. Since then, scholars have advanced several theories seek- 
ing to explain why. For example, status discontent approaches focus on threats 


31n Jelen & Wilcox's academic sample (professors of political science, sociology, history, and 
English), 90% thought "Evangelicals/Religious Right" have too much power, 86% had the 
strongest view of the separation of church and state, and 7596 thought "Evangelicals/Religious 
Right” are a threat to democracy (only 36% mentioned "Nazis/KKK/Racist Right") 

44Conflict may be accentuated because RR organizations use boycotts and letter campatgns to 
challenge the economic interests of the media conglomerates that produce many television shows, 
movies, and news programs 

45Ronald Reagan did this skillfully by, for example, promoting a constitutional amendment 
allowing school prayer, but only after he knew there were not enough Congressional votes for it to 


pass (Sider 1997) 
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to the social or economic status of religious groups. They suggest that religious 
identities become more salient, commitments more firm, and resources more 
easily mobilized, the more a religious group feels their social status 1s threat- 
ened. Originally developed to explain right-wing political extremism, and 
largely unfashionable among contemporary sociologists, versions of status 
discontent theory still find occasional proponents among scholars of religion 
(e.g., Wald et al 1989, Lorentzen 1980, Page & Clelland 1978; see Thurow 
1996, p. 232). 

Another theoretical approach, “strictness” theory, focuses on the differen- 
tial impact of the microlevel normative demands religious groups impose on 
their members. In short, “strict” religious groups thrive, while “lenient” relig- 
ious groups decline. Kelley (1972, 1978) argued that religions that deliver sub- 
stantial meaning to their adherents thrive. Religions produce meaning by de- 
manding that their followers respond to their beliefs by committing their time, 
money, energy, reputations, and selves in a way that validates and invests in 
those ideas. “Meaning = concept + demand” (1972, p. 52). More recently, Ian- 
naccone (1992, 1994) suggests that strict religions thrive because they screen 
out free-riders (i.e., people who enjoy many of the benefits of the religious 
group while contributing little to the group). Strict churches demand that their 
members contribute their fair share of time, money, and emotional energy to 
generate the collective religious goods all enjoy. Strict CP churches that screen 
out free-riders enjoy higher degrees of commitment, solidarity, and mutual 
rewards, all of which make them thrive and grow. 

A third theory of religious vitality, based on the economic model of rational 
choice, is called the “religious economies" or “supply side” theory (Finke & 
Stark 1988, 1989, 1992, Finke & Iannaccone 1993, Finke et al 1996). This the- 
ory claims that religious regulation and monopolies create lethargic religions, 
but pluralistic, competitive environments allow entrepreneurial religious 
groups to thrive. In these environments, religious “firms” (denominations and 
traditions) that possess superior organizational structures (denominational 
polities), sales representatives (evangelists and clergy), products (religious 
messages), and marketing (evangelistic techniques) flourish (Finke & Stark 
1989). Those that cannot successfully compete, decline. CP vitality, then, is a 
result of unregulated religious environments that stimulate aggressive CP re- 
ligious entrepreneurs to promote more diverse religious products that satisfy 
ever-expanding markets of religious consumers. 

Finally, seeking to elaborate sociology of religion's “new paradigm" (War- 
ner 1993) in cultural and structural rather than economistic terms, Smith et al 
(1998) advance a “subcultural identity" theory of evangelical church vitality. 
It suggests that in a pluralistic society, those religious groups will be relatively 
stronger that better possess and employ the cultural tools needed to create both 
clear distinction from, and significant engagement and tension with, other rele- 
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vant outgroups (short of becoming genuinely countercultural). Smith et al 
(1998) argue that evangelicalism possesses precisely these cultural tools. For- 
mulated in contrast to older modernization-secularization interpretations of 
evangelicalism (e.g., Hunter 1983, 1987), the subcultural identity theory at- 
tempts to demonstrate how and why American evangelicalism thrives within its 
modern, pluralistic environment. The elaborated theory suggests that the human 
drives for meaning and belonging are satisfied primarily by locating human 
selves within social groups that sustain distinctive, morally orienting identities; 
that social groups construct and maintain collective identities by drawing sym- 
bolic boundaries between themselves and relevant outgroups; that modern be- 
leyers establish stronger religious identities and commitments through individ- 
ual choice than through ascription; that people define their values and evaluate 
themselves in relation to specific reference groups; that modern pluralism pro- 
motes the formation of strong subcultures and potentially deviant identities; 
and that inter-group conflict in a pluralistic context typically strengthens in- 
group identity, solidarity, resources mobilization, and membership retention. 

Theories of CP organizational vitality remain contested (see Marwell 1996, 
Bibby 1978, Chaves 1989, Hunter 1987, pp. 203—6, Perrin & Mauss 1993), and 
more empirical research is needed to evaluate their usefulness. 


CONCLUSION 


Conservative Protestantism represents at least a quarter of the US population 
and significantly influences its adherents' attitudes and behaviors. Conceptual 
problems and measurement difficulties have obscured the importance of CP 
faith in many previous sociological studies. We suggest that greater clarity in 
definitions, understanding of history, use of sophisticated measurement tools, 
and attention to complex and nuanced qualitative data should significantly en- 
hance our understanding of conservative Protestantism and its social signifi- 
cance in the years ahead. This type of research 1s already beginning to alter our 
understanding of CPs' gender-role attitudes, childrearing strategies, tolerance, 
and political behavior. 
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ABSTRACT 


Initial sociological interest in network forms of organization was motivated 
in part by a critique of economic views of organization. Sociologists sought 
to highlight the prevalence and functionality of organizational forms that 
could not be classified as markets or hierarchies As a result of this work, we 
now know that network forms of organization foster learning, represent a 
mechanism for the attainment of status or legitimacy, provide a variety of 
economic benefits, facilitate the management of resource dependencies, and 
provide considerable autonomy for employees. However, as sociologists 
move away from critiquing what are now somewhat outdated economic 
views, they need to balance the exclusive focus on prevalence and function- 
ality with attention to constraint and dysfunctionality. The authors review 
work that has laid a foundation for this broader focus and suggest analytical 
concerns that should guide this literature as it moves forward. 


INTRODUCTION 


Over the past decade or so, sociological interest in network forms of organiza- 
tion has blossomed. Sociologists have become increasingly intrigued by the 
plethora of organizational configurations that fail to conform to traditional 
definitions of markets or hierarchies. Part of the interest in these alternative or- 
ganizational arrangements is no doubt due to what some regard as their in- 
creased empirical prevalence (Kanter 1991). 
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While a number of scholars have convincingly challenged the view that 
these forms are more prevalent now than at other times in history (e.g., Claw- 
son 1980, Granovetter 1995, Laumann 1991), it nonetheless remains true that 
changes in the US regulatory environment greatly facilitated the ability of US 
firms to engage in cooperative activities with their market competitors. For ex- 
ample, the National Cooperative Research Act enabled coordinated research 
and development activity across firm boundaries to an extent that had not been 
allowed in the past. Such regulatory changes were themselves a consequence 
of another empirical phenomenon in the 1980s that also increased scholarly in- 
terest in these network forms of organization: the worldwide competitive suc- 
cess of Japanese and, to a lesser extent, other Asian firms. Because Japanese 
firms seemed to rely extensively on network forms of organization, there 
emerged great interest on the part of both scholars and practitioners in under- 
standing the extent to which that reliance was itself a determinant of competi- 
tive success (Lincoln et al 1996, Gerlach 1992, Orr et al 1991). 

Yet, if part of the motivation was empirical, another part was that the exis- 
tence, prevalence, and functionality of these organizational arrangements rep- 
resented a challenge to economic views of organization that were becoming 
popular during this time period (Granovetter 1985). Prior to the middle 1970s, 
economists had largely regarded the organization as a black box that is to be 
understood as a production function converting inputs to outputs. In the middle 
1970s and early 1980s, economists started to look inside the black box, and 
two perspectives in particular became quite prominent: principal-agent theory 
and transaction cost economics. At least when they first emerged, each per- 
spective was grounded in a dichotomous view of economic organization: mar- 
kets, on the one hand, and hierarchies, on the other. 

While this dichotomous view was perhaps more implicit than explicit in the 
principal-agent tradition, it was quite explicit in transaction cost economics. 
For example, whereas Oliver Williamson, one of the leading figures of the 
transaction cost perspective, acknowledged that other forms of organization 
existed, he nonetheless asserted two points. First, the alternatives to pure mar- 
kets and pure hierarchies can be interpreted as intermediate or hybrid forms, 
combining elements of markets and hierarchies (Williamson 1991). Second, 
the distribution of organizations along the markets-hierarchies continuum is 
“thick in the tails" (Williamson 1985). That is, pure types tend to prevail over 
the mixed forms. 

Sociological research on network forms of organization sought to challenge 
both of these points. First, sociologists argued that network forms of organiza- 
tion could not be considered hybrids of markets or hierarchies; rather, network 
forms of organization represented a unique alternative possessing its own 
logic (Powell 1990). Second, sociologists argued that the network form of or- 
ganization has a number of distinct efficiency advantages not possessed by 
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pure markets or pure hierarchies, and because of these efficiency advantages, 
network forms are quite prevalent (Bradach & Eccles 1989). 

In the following pages, we review and highlight a number of important in- 
sights into the nature and functionality of the network form of organization. 
At the same time, we question whether the emphasis on the functionality of 
the network form has perhaps gone too far. In elucidating functions, sociolo- 
gists are prone to neglect constraints that underlie the formation of network 
forms of organization, problems that arise in their governance, and boundary 
conditions on their functionality. We may have a good understanding of why 
economic actors want to utilize network forms of organization, but we have 
less understanding of why they do not. That is, we have little understanding of 
the reasons why variance exists in the utilization of network forms of organi- 
zation or why a given focal actor would pursue one network partner and not 
another. 


WHAT IS A NETWORK FORM OF ORGANIZATION? 


From a purely structural perspective, the trichotomy among market, hierarchy, 
and network forms of organization is a false one. Markets and hierarchies are 
simply two pure types of organization that can be represented with the basic 
network analytic constructs of nodes and ties (Laumann 1991). For example, 
one might operationalize a spot market as a population of isolates. Each market 
actor is a node that lacks any ties to the other actors/nodes. One could opera- 
tionalize a hierarchy as a centralized network in which the vast majority of ties 
flow to or from one particular node. In effect, from a structural perspective, 
every form of organization is a network, and market and hierarchy are simply 
two manifestations of the broader type. 

However, when considered as a form of governance, the network form can 
be distinctly characterized. We define a network form of organization as any 
collection of actors (N = 2) that pursue repeated, enduring exchange relations 
with one another and, at the same time, lack a legitimate organizational 
authority to arbitrate and resolve disputes that may arise during the exchange. 
In a pure market, relations are not enduring, but episodic, formed only for the 
purpose of a well-specified transfer of goods and resources and ending after 
the transfer. In hierarchies, relations may endure for longer than a brief epi- 
sode, but a clearly recognized, legitimate authority exists to resolve disputes 
that arise among actors. 

This definition of a network form of organization includes a wide array of 
joint ventures, strategic alliances, business groups, franchises, research con- 
sortia, relational contracts, and outsourcing agreements. This definition ex- 
cludes most pure market arrangements such as short-term contracts or spot 
market transactions, and it excludes employment relations. Yet, while it is 
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tempting to provide a list of formal organizational arrangements that can or 
cannot be categorized as network forms of organization, any such list would 
obscure important variance within formal organizational types. For example, 
consider syndicates—collections of (typically financial) actors that pool assets 
to support a high-risk endeavor in exchange for profits from that endeavor. 
Syndicates can differ in the extent to which authority for managing the syndi- 
cate is vested in one actor, and they can differ in terms of their time horizon. In 
fact, an important trend among investment banks in the twentieth century was 
the increasingly transitory character of syndicates for the underwriting of se- 
curities (Carosso 1970, Eccles & Crane 1987). Such decentralization affected 
the extent to which this particular interorganizational arrangement conformed 
to the definition of the network form. 

More generally, a significant sociological finding is that many of the eco- 
nomic arrangements that are formally labeled markets and hierarchies may 
conform empirically to the definition of network organization laid out above. 
For example, in his work on transfer pricing within organizations, Eccles 
(1985) discusses a number of cases in which organizational divisions lack a le- 
gitimate authority to set the price for the internal transfer of goods and arrive at 
prices based on negotiation. Therefore, while we provide a list of formal or- 
ganizations typically regarded as manifestations of network forms of organiza- 
tion for the sake of illustration, we emphasize that there 1s no clear mapping of 
formal organizational arrangements onto the network form. 

Even though network forms of organization cannot be identified according 
to some limited set of labels for formal organizational arrangements, a number 
of scholars have argued that network forms of organization can be character- 
ized by a distinct ethic or value-orientation on the part of exchange partners. In 
his analysis of long-term buyer-supplier relations among Japanese firms, Ron- 
ald Dore (1983) points to what he calls the "spirit of goodwill" underlying 
these relationships. The central elements of this spirit of goodwill are a com- 
mitment to use "voice" rather than "exit" (cf. Hirschman 1970) to resolve dis- 
putes and a high level of trust between the parties. The buyer tries to work with 
the seller to address any deficiencies in the seller's performance rather than 
simply moving to another seller. Buyer and seller are both willing to make 
relationship-specific investments without contractual guarantees protecting 
those investments because each party expects that the other will not use the 
relationship-specific investments to its own advantage. Similarly, Powell 
(1990) argues that a norm of reciprocity is a guiding principle underlying net- 
work forms of organization. Each member ofthe network feels a sense of obli- 
gation to the other party or parties rather than a desire to take advantage of any 
trust that may have been established. In his analysis of business groups, Gra- 
novetter (1995) also points to a high level of trust and obligation among mem- 
bers of the group. He argues that a distinctive feature of such groups is that 
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they constitute a moral community insofar as “trustworthy behavior can be ex- 
pected, normative standards understood, and opportunism foregone." Finally, 
in a treatise on what he calls “small firm networks," Perrow (1993) identifies 
trust as a critical element of small firm production networks. 

Probably the most vivid illustration of trustworthiness and obligation in a 
network form of organization comes from Uzzi's (1997) examination of sub- 
contracting relationships in the New York garment industry. Uzzi is particu- 
larly interested in what he refers to as “embedded ties," strong enduring rela- 
tions between manufacturers and subcontractors. He describes the case of a 
manufacturer that had decided to move all of its production facilities overseas 
to Asia. Because of this upcoming move, the manufacturer would no longer be 
relying on its subcontractors in New York. He writes: 


As a result [of this move], this manufacturer had strong incentives not to tell 
its contractor that it intended to leave. Doing so put it at risk of receiving low- 
quality goods from contractors who now saw the account as temporary and 
had to redirect their efforts to new manufacturers who could replace the lost 
business. Yet the CEO of this manufacturer personally notified his embed- 
ded ties, because his relationships with them obliged him to help them adapt 
to the closing of his business, and his trust in them led him to believe that they 
would not shirk on quality. Consistent with his account, one of his contrac- 
tors said that the jobber's personal visit to his shop reaffirmed their relation- 
ship, which he repaid with quality goods. This same manufacturer, however, 
did not inform those contractors with which it had arms-length ties. (Uzzi 
1997, p. 55) 


This example is noteworthy because there exists no shadow of the future to en- 
sure cooperation in the present. Moreover, because the manufacturer is mov- 
ing overseas, it has no need to preserve its local reputation. Cooperation does 
not arise as a route to future gains. 

While there may be subtle differences in each author's understanding of the 
trusting ethic guiding economic exchange in network forms of organization, 
these subtle differences need not concern us here. What is important is that this 
more trusting ethic 1s one of the defining elements of a network form of gov- 
ernance, and the network form of governance is therefore not reducible to a hy- 
bridization of market and hierarchical forms, which, in contrast, are premised 
on a more adversarial posture. 

To be sure, it is probably true that a moral community or spirit of goodwill 
is not a functional necessity for a network form of organization to exist. If two 
economic actors wish to enter into an enduring relation and lack a legitimate 
authority to resolve disputes, they may enter into a long-term contract in order 
to place restrictions on the opportunistic behavior of one another. In the con- 
tract, they can include provisions that allow for anticipated changes or allow 
for recontracting at a later date based on unanticipated changes. However, in 
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the face of unexpected changes to the opportunities and constraints confront- 
ing parties to the exchange, an exchange relation governed by a contract with 
provisions only for anticipated changes will generally be less flexible than an 
exchange governed by a norm of reciprocity. Moreover, a contract allowing 
for recontracting at a later date based on unanticipated changes to circum- 
stance requires some level of trust that the other party will act in good faith at 
the time of recontracting. In short, while a long-term contract may represent a 
substitute for what some have identified as a moral community, spirit of good- 
will, or norm of reciprocity, such a contract is not likely to allow for the same 
flexibility and adaptability as these ethics of exchange. 


FUNCTIONS OF NETWORK FORMS OF ORGANIZATION 


An increasingly large volume of research has sought to highlight the function- 
ality of network forms of organization. Sociologists and organizational schol- 
ars have claimed that network forms allow participating firms to learn new 
skills or acquire knowledge, gain legitimacy, improve economic performance, 
and manage resource dependencies. In addition, the widespread use of net- 
work forms of organization may have unintended social welfare benefits. We 
consider each of these proposed advantages separately. 


Learning 


A number of scholars have emphasized the learning benefits of network forms 
of organization (Dore 1983, Powell 1990, Uzzi 1997, Hamel 1991). Network 
forms of organization foster learning because they preserve greater diversity 
of search routines than hierarchies and they convey richer, more complex in- 
formation than the market. As Powell (1990) writes, “the most useful informa- 
tion is rarely that which flows down the formal chain of command in an organi- 
zation, or that which can be inferred from price signals. Rather, it is that which 
is obtained from someone you have dealt with in the past and found to be reli- 
able" (p. 304). 

There are two ways in which network forms of organization can foster 
learning. First, they can encourage learning by promoting the rapid transfer of 
self-contained pieces of information. In this view, network ties are conduits or 
channels (e.g., Contractor & Lorange 1988b, Root 1988, Hamel 1991, Kogut 
1988b). Hamel (1991) is perhaps the most explicit in examining how interfirm 
collaborations provide participating firms with opportunities to internalize one 
another's skills. Conceiving firms as portfolios of skills, Hamel argues that 
network forms of organization are less a compromise between market and hi- 
erarchy (Grant 1996), to use Williamson's (1975) terminology, and more an 
alternative to other modes of skill acquisition. This understanding of learning 
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through networks is quite consistent with some of the early network research 
on information transfer. For example, 1n Granovetter's (1974) seminal re- 
search on job search, information on jobs resides at nodes and is transferred 
through the ties linking nodes. 

Alternatively, as Powell & Brantley (1992) contend, network forms of or- 
ganization may foster learning by encouraging novel syntheses of information 
that are qualitatively distinct from the information that previously resided 
within the distinct nodes. That 1s, rather than simply facilitating the transfer of 
information between two nodes, the existence of an enduring exchange rela- 
tion may actually yield new knowledge. In effect, the network becomes the lo- 
cus of innovation rather than the nodes that comprise the network. 

In a study of the biotechnology industry, Powell et al (1996) attempt to test 
empirically the claim that when the knowledge of an industry is broadly dis- 
tributed and rapidly changing, the locus of innovation will be found in interor- 
ganizational networks of learning, rather than in individual firms. In their 
study, the authors find some evidence of a liability of unconnectedness; 
strong-performing biotechnology firms have larger, more diverse alliance net- 
works than do weak-performing firms. While this result is consistent with the 
authors' hypothesis, it is also consistent with a number of others. First, the link 
between connectedness and performance does not necessarily mean that learn- 
ing and innovation constitute the intervening process between structure and 
performance. As we discuss further below, network ties may serve a number of 
other functions such as managing resource dependencies or enhancing legiti- 
macy, both of which have positive effects on performance. Second, even if the 
tie count reflects learning, it is not clear whether ties are conduits for informa- 
tion flow or are actually loci of innovation that would not arise in the absence 
of the ties. 

Using a firm's position in a network of patent citations as a measure of tech- 
nological position, Stuart & Podolny (1996, 1997) attempt to establish a closer 
connection between alliances and learning. The authors illustrate how patent 
citations can be used to establish (a) the technological distance of a firm from 
its alliance partners and (5) the extent to which a firm's current inventions dif- 
fer considerably in content from its past inventions. Stuart & Podolny (1997) 
find that the greater a firm's technological distance from its alliance partners, 
the higher the likelihood that the focal firm produces inventions that are con- 
siderably different in content from its previous inventions. Such a finding pro- 
vides more direct support for the learning hypothesis, but it does not distin- 
guish between the two means of learning identified above. 

To know whether alliances yield novel syntheses, we would need to know 
not only whether a firm's inventions were significant departures from past in- 
ventions but also whether the firm's inventions were qualitatively distinct 
from its alliance partners' past inventions. If alliances yield inventions that are 
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qualitatively distinct from the inventions of either partner, then we can more 
confidently assert that the locus of innovation is in the network itself rather 
than the nodes of that network. While such research has not yet been con- 
ducted, it seems a straightforward extension of the current work using patent 
citations as proxies for technological distance. 


Legitimation and Status 


A number of scholars have argued that if an actor’s partner in a network form 
of organization possesses considerable legitimacy or status, then the actor may 
derive legitimacy or status through the affiliation. This legitimacy or status 
may in turn have a number of positive economic benefits for the actor, ranging 
from survival to organizational growth to profitability. For example, in a study 
of daycare centers, Baum & Oliver (1992) find that a tie to a legitimate institu- 
tional actor, such as a church or governmental entity, has a positive effect on 
the life chances of an organization. In a study of the investment banking indus- 
try, Podolny & Phillips (1996) find that the higher the status of a bank's man- 
agement partners in underwriting syndicates at time f, the greater its status 
growth between time ¢ and £+ 1. This enhanced status, in turn, has positive eco- 
nomic advantages for the organization (Podolny 1993). 

In one ofthe more compelling demonstrations of the economic value of ties 
to legitimate or high-status actors, Stuart et al (1997) examine the economic ef- 
fects of the interorganizational networks of privately held biotechnology 
firms. These authors find that an affiliation with a prominent alliance partner in- 
creases the market value of the biotechnology firm. What is particularly notable 
about this study is that the authors seek to empirically disentangle the legitimat- 
ing or status-enhancing effects of these ties from the resources that would flow 
from such ties. The authors argue that the legitimating or status-enhancing ef- 
fects of an affiliation with a prominent actor should vary with the age of the 
start-up. When a start-up is young, there is considerable investor uncertainty 
about its quality. As the start-up ages, this uncertainty inevitably declines since 
investors have more history on which to base their inferences. If ties to promi- 
nent actors are primarily symbolic in their significance, then the effect of such 
ties should be greatest for the young start-ups, whose quality is most uncertain. 
Conversely, if the ties to prominent actors are simply proxies for superior re- 
source flows, then the effect should not vary with age; older and younger firms 
should benefit equally from superior resource flows. Consistent with an inter- 
pretation of these ties as carriers of legitimacy, Stuart and his associates find 
that the effect of affiliations varies inversely with the age of the start-up. 

Finally, Stark (1996) offers one of the most intriguing accounts of the legiti- 
mating effects of network ties. Stark examines the development of organiza- 
tional forms in postsocialist Hungary. In this transitional period, multiple so- 
cioeconomic and sociopolitical orders exist simultaneously, with different and 
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sometimes contradictory bases on which organizations can lay legitimate 
claims to resources. Decentralized networks of organizations emerge, and as- 
sets and liabilities are reallocated within the network in such a way that the net- 
Work represents a hedge against uncertainty in the political and economic envi- 
ronment. In Stark’s account, status and legitimacy are acquired not by virtue of 
a tie to an actor that is generally regarded as high status or legitimate. Rather, 
legitimacy is attained through a distribution of liabilities and assets within the 
organizational network that is robust with respect to the multiple, contending 
sociopolitical and socioeconomic orders. 

Whereas all of the above accounts of legitimacy emphasize benefits that 
flow from one network partner to another, Baum & Oliver (1991, 1992) make 
the additional claim that a focal organization's tie to a legitimate actor has 
positive externalities for others in the focal organization's population. À tie to 
a legitimate actor outside of the organizational population helps to institution- 
alize the population as a whole. Sharfman et al (1991) make a similar argu- 
ment. 


Economic Benefits 


In elaborating functions fulfilled by the network form of organization, it is im- 
portant not to overlook the direct economic benefits of this form in terms of 
costs and quality. Williamson (1991) lays out conditions under which network 
forms of organization lower transaction costs, though it bears repeating that 
the transaction cost perspective does not see trusting or altruistic behavior as 
particularly germane to the network form of organization. À number of econo- 
mists and strategy scholars attempt to assess empirically the relevance of the 
transaction cost perspective to the network form of organization (Hennert 
1988, 1991, Zajac & Olsen 1993, Parkhe 1993, Buckley & Casson 1988, 
Stuckley 1983). Perhaps because they seek to elaborate a view of network 
forms of organization that is distinct from economic views, sociologists down- 
play or reject the role of transaction costs in the adoption of the network form 
(e.g., Powell et al 1996, Lazerson 1993, Bradach & Eccles 1989). Moreover, 
when sociologists see transaction costs reduced through the network form, 
they emphasize the reliance on trust rather than contractual provisions as the 
primary basis on which transaction costs are reduced (e.g., Dore 1983). 

Perhaps more importantly, sociologists stress quality advantages rather 
than costs as the primary economic benefit. For example, comparing long-term 
or embedded subcontracting relations to arms-length subcontracting relations, 
Uzzi (1997) argues that the former are more conducive to high-quality produc- 
tion because they enable richer communication between buyer and supplier on 
issues pertaining to quality. 

Some sociologists also claim that one of the economic benefits of the net- 
Work form of organization is the adaptability of this form to unanticipated en- 
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vironmental changes (Powell 1990, Kanter 1991). By fostering greater com- 
munication than the market does, network forms of organization facilitate 
greater coordination in the face of changes whose significance cannot be com- 
pletely conveyed or understood through price signals. At the same time, be- 
cause the boundaries of network forms of organization are generally easier to 
adjust than the boundaries of hierarchies, it is easier to modify the composition 
of network organizations to respond to those changes (Sorenson 1997). 


Other Benefits of the Network Form of Organization 


In addition to the benefits just listed, at least two other advantages are empha- 
sized by sociologists and organizational scholars. First, following Selznick's 
(1949) initial insights regarding organizational cooptation, resource depend- 
ence scholars posit that organizations can alleviate sources of external con- 
straint or uncertainty by strengthening their relationship with the particular 
sources of dependence. Pfeffer & Nowak (1976) apply this general insight to 
the formation of joint ventures. They find that that oligopolistic industries (i.e., 
industries of intermediate concentration) have the highest proportion of firms 
engaged in within-industry joint ventures. Since oligopolistic industries are 
those in which firms face the highest uncertainty about the actions of their 
competitors, Pfeffer & Nowak take this finding as evidence that joint ventures 
are a means for reducing that uncertainty. Second, Perrow (1993) identifies a 
number of social welfare benefits with what he refers to as small firm net- 
works, or networks of small producers. He argues that in comparison to larger, 
bureaucratic forms of organization, small firm networks provide individuals 
with greater autonomy, lead to less inequality in the distribution of wealth, and 
foster a greater sense of community. 


WHY ARE THERE MARKETS AND HIERARCHIES? 


While this list of functions or benefits of the network form of organization is 
perhaps not exhaustive, it captures the vast majority of those that sociologists 
and organization scholars have identified. This research has yielded some im- 
portant insights. Yet, a review of the work does raise an important question: 
Why do not all actors within an organizational population rely exclusively on 
the network form? That is, if a network form of governance can result in supe- 
rior learning, enhanced legitimacy and prestige, greater control over the exter- 
nal environment, and economic benefits, why are there any markets and hierar- 
chies remaining? In effect, this attention to the functionality of network forms 
of organization explains why economic actors rely on network forms of or- 
ganization, but it does not explain why they do not. 

To be sure, this work potentially provides some understanding of cross- 
industry or cross-population variance in the utilization of network forms of or- 
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ganization. For example, to the extent that network form fosters learning, the 
form should be more prevalent in industries where knowledge is broadly dis- 
persed and knowledge is rapidly updated (Powell & Brantley 1992). Based on 
this argument, one would expect the form to be more prevalent 1n the biotech- 
nology industry than in the steel industry, for example. 

But what explains the variance 1n the utilization of this form within the bio- 
technology industry? More generally, what are the determinants of intrapopu- 
lation or intraindustry variance in the utilization of network forms of organiza- 
tion? This concern with intrapopulation variation in the utilization of network 
forms of organization can be framed as two research questions. First, what de- 
termines the extent to which an economic actor chooses to rely on a network 
form of governance? Second, to the extent that an economic actor wishes to 
employ a network form of governance, what are the constraints on the pattern 
of network relationships that the actor may form? 

In an examination of relationships between corporations and investment 
bankers, Baker (1990) attempts to answer this first question. Baker distin- 
guishes three market interfaces that can link an investment bank to its corporate 
issuers: a relationship interface involving long-term ties with at most a few 
banks; a transaction interface involving short-term ties with numerous banks; 
and a hybrid interface representing an intermediate category between the other 
two. The relationship interface obviously corresponds to the network form of 
governance, whereas a transactional interface corresponds to market govern- 
ance. Using a number of measures of a corporation’s power, such as its size or its 
availability of resource alternatives, Baker finds that a more transactional orien- 
tation is associated with greater corporate power. Only those corporations that 
are weaker and more dependent on investment banks adopt the relationship in- 
terface. 

These findings are important insofar as they provide some initial insight 
into why actors would adopt a network form of organization. However, further 
work must be done to integrate these findings with the literature emphasizing 
the functionality ofthe network form of governance. In Baker's work, corpora- 
tions only adopt the network form when they are too weak to adopt an alterna- 
tive, more transactional interface. If network forms of organization are func- 
tional for the reasons elaborated above, then it 1s difficult to understand why 
only weak corporations would prefer this form. One possibility 1s that power- 
ful firms are in less need of the benefits yielded by alliances. However, given 
the broad set of advantages claimed for the network form of governance, this 
proposition seems difficult to sustain. Another possibility 1s that unique fea- 
tures of the investment banking industry limit the benefits of the network form 
of organization. This conclusion, too, is difficult to sustain in light of research 
highlighting functional benefits of the relationship interface in this industry 
(Eccles & Crane 1987). 
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While not necessarily directed toward the specific topic of network forms 
of organization, ecological arguments on inertia (Hannan & Freeman 1989) 
may nonetheless provide some analytical leverage 1n understanding intrapopu- 
lation variance in the adoption of this form. Drawing on Stinchcombe's (1965) 
arguments about organizational imprinting, ecologists argue that important 
features of an organization's structure are established early in an organiza- 
tion's history, and these features can be difficult to alter. For example, older 
firms in the computer industry such as IBM or DEC are vertically integrated. 
At the time of their entry into the computer industry, such firms had strong 
functional reasons for a high level of vertical integration. An absence of effi- 
cient markets for various components of computers drove firms to integrate 
vertically. In contrast, younger computer firms such as Sun Computers, Com- 
paq, and Silicon Graphics are vertically disintegrated, relying extensively on 
outsourcing relationships. These younger firms emerged during a time when 
efficient markets existed for many of the various components within comput- 
ers. Because of the difficulties involved in significant organizational restruc- 
turing, older firms have been either unable or unwilling to modify their verti- 
cally integrated structures to take advantage of the more efficient markets for 
computer components. 

A third basis for variance in the propensity to adopt the network form of or- 
ganization is nationality. Cultural and legal differences across countries can be 
the basis for differences in the propensity of organizations to adopt network 
forms of organization. Dore (1983), for example, argues that the ethic underly- 
ing a network form of organization is more consistent with a collectivist orien- 
tation and thus more prevalent in countries where individuals subscribe to the 
collectivist orientation. In Italy, tax laws favor small employers. Such laws 
greatly encourage the formation of small-firm networks over the formation of 
large hierarchies. 

Power, conditions at time of founding, and nationality have thus been iden- 
tified as three factors that potentially affect intrapopulation or intraindustry 
variance in the adoption of the network form. Yet, of these three factors, only 
two can explain within-country variance, and of these two, one is difficult to 
reconcile with the view that network organizations are—at least at times— 
functional and desirable rather than a response,to a weakened power position. 
Moreover, the organizational imprinting argument does not by itself explain 
why network forms were more prominent in some periods than in others. Fur- 
ther research obviously needs to be undertaken to explain differences in the 
propensity of economic actors to adopt the network form of organization. 

We now shift from work focusing on the propensity of an actor to adopt a 
network form to work that seeks to explain the pattern of network ties that arise 
among a population of actors. Some research suggests that the pattern of rela- 
tions follows a functional logic. That is, a network tie arises when it is most 
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likely to foster one of the functions listed above. For example, Lincoln et al 
(1992) examine transaction cost and resource dependence motivations under- 
lying the pattern of ties among leading Japanese firms. In addition, implicit in 
Hamel's (1991) assertion that firms form alliances to obtain skills is a hypothe- 
sis that a firm will form alliances with those that are most able to provide the 
skill set needed by the firm. 

Though there is value in linking the functions of the network form of or- 
ganization to the pattern of ties that arise among a population of actors, a cen- 
tral feature of the sociological perspective is its attention to constraints on ac- 
tion that lie outside of the purposive behavior of individual actors. Even if a 
particular network tie would convey one of the functional advantages listed 
above, there may still be reasons why a dyad would be unable to form an en- 
during exchange relation in the absence of a legitimate authority. Neverthe- 
less, despite the centrality of the constraint emphasis to the sociological per- 
spective, there is surprisingly little work that highlights these constraints. 

Mowery et al (1996) draw on Cohen & Levinthal's (1989, 1990) concept of 
absorptive capacity to explain the pattern of technology-sharing alliances. 
They contend that a firm's ability to absorb knowledge from a potential partner 
is contingent on the stock of related knowledge. Therefore, a firm is unlikely to 
enter into an alliance with another firm whose technology is highly different 
from its own. At the same time, the authors assert that a firm is unlikely to enter 
into an alliance with a partner that possesses redundant technology. Accord- 
ingly, the authors hypothesize and find evidence that a firm is most likely to 
form technology-sharing alliances with a firm whose technology is at an inter- 
mediate technological distance. 

Podolny (1994) examines the pattern of syndicate relations among invest- 
ment banks. He argues that in markets where there is high uncertainty about 
the quality of a good or service that an actor brings to market, an actor's status 
may limit the potential exchange partners to which the actor has access. High 
status actors must avoid affiliating with low status actors in order to avoid 
risking a loss of their own status, and low status actors are thereby constrained 
in their ability to enter into exchange relations with high status actors. 

Gulati (1995) offers probably the most general account of the constraints 
underlying the pattern of network tie formation. In a multi-industry examina- 
tion of alliance formation, he argues that one of the most important determi- 
nants ofthe pattern of alliances at time t is the preexisting pattern of alliances at 
t — 1. More specifically, Gulati hypothesizes that the probability that points in a 
dyad will enter into an alliance with one another is a function of past direct 
contact between the pair and the presence of indirect network connections 
through others in the industry. Gulati asserts and finds evidence that these indi- 
rect ties serve both a referral and a control function. They provide information 
on each potential partner’s reliability, and they represent a source of peer sanc- 
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tioning when one party does not act in good faith in the context of the alliance. 
Powell et al (1996) also develop some similar hypotheses as to how the pre- 
vious pattern of ties predicts the existing patterns of ties. Framing these results 
in terms of constraint, a firm will generally be unwilling or unable to form a tie 
with another if it lacks some indirect connection to that other. 

Of course, any endogenous explanation of tie formation, in which ties at 
one time period lay a foundation for ties at a subsequent time period, begs the 
question: What are the determinants of the initial ties ın which a firm is ın- 
volved? Especially in rapidly growing industries like biotechnology, where so 
many new organizations enter each year without ties, the presence of previous 
direct or indirect ties cannot be a critical explanatory vanable in understanding 
the distribution of ties within a population. Gulati & Gargiulo (1997) have 
started to respond to this concern by situating the endogenous dynamic in a 
broader evolutionary framework. 

As work on constraints underlying the pattern of network relations moves 
forward, it will be important to attend to two observations. First, the pattern is a 
function not only of the formation of new ties but of the persistence of estab- 
lished ties, and, second, constraints operate not only on tie formation but also 
on tie persistence. For example, consider Gulati’s finding that a firm’s ability 
to form an alliance with another is contingent on the presence of previously ex- 
isting indirect ties to that firm. To the extent that this claim is true, a firm has an 
incentive to preserve an alliance simply to help lay the foundation for future al- 
liances with others even if that focal alliance provides no tangible benefit it- 
self. Moreover, given the importance of trust and obligation to the successful 
operation of network ties, there are inevitably limits on how much an actor can 
alter its network in response to changes in self-interest (Portes & Sensenbren- 
ner 1993). 

The existence of constraints on the breaking of ties has clear, specific impli- 
cations for the previously mentioned claims regarding the adaptability of net- 
work forms of organization. If there are reputational costs from breaking ties, 
then there are at least some circumstances in which market and hierarchical 
forms of governance, which are not premised on trust and obligation, will be 
more adaptable than a network form, which 1s. More generally, if there are 
constraints on the dissolution of ties, then it seems quite reasonable to con- 
clude that network ties can often outlive the duration of their functionality. A 
network tie that may have originated for strongly functional reasons persists 
only for the purpose of preserving reputation. 


WHY ARE THERE NO NETWORK FAILURES? 


This discussion of constraints on the dissolution of network ties naturally leads 
to another topic that has received scant attention in sociological research on 


NETWORK FORMS OF ORGANIZATION 71 


network forms of organization: the dysfunctionalities that arise through the 
operation of the network. One possible reason for the inattention to dysfunc- 
tionalities is that network forms of organization may indeed constitute a su- 
perior organizational form. A number of scholars (e.g., Kanter 1991, Powell 
1990, Perrow 1993) seem at least implicitly to make the claim that alliance 
capitalism is simply more efficient and effective than a capitalism premised on 
arms-length transactions among large hierarchies, especially when efficiency 
and efficacy hinge on the coordination of a complex array of elements. 

However, this enthusiasm for network forms of organization seems diffi- 
cult to reconcile with an important fact: Àn extremely large fraction of net- 
work forms of organization do not perform the function for which they were 
designed (Kogut 1988a, Killing 1982, Inkpen 1996). While there are essen- 
tially no scientific studies of the failure rate of network forms of organization, 
journalistic and managerial sources are essentially unanimous in the conclu- 
sion that an extremely large proportion of at least one common type of network 
organization—strategic alliances—-result in failure. For example, the Boston 
Consulting Group, which undertook a study of the performance of alliances in 
the airline industry—an industry with 401 alliances in 1995— estimated that 
fewer than 40% of regional alliances and fewer than 30% of international alli- 
ances should be considered successes (The Economist 1995). Similarly, Sa- 
vona (1992) refers to a study finding that the average joint venture lasts less 
than 3.5 years and that fewer than one third of Joint ventures are considered 
successes. 

Obviously, such findings must be supplemented with more scientific analy- 
ses. Enthusiasts for the network form of organization could perhaps respond to 
such a finding by observing that the dysfunctionality of a particular form is 
relative, and one would need to compare the failure rate of alliances to the fail- 
ure rate of other organizational structures. Fair enough. Such an analysis 
would clearly be of value to establishing the relative performance of the net- 
work form of governance. 

However, even in the absence of such a study, it seems important to pay 
more attention to conditions under which network forms of organization meet 
their objective and those under which they do not. While there is some ac- 
knowledgment that networks do fail (e.g., Powell & Smith-Doerr 1994), such 
acknowledgments have not yet had a significant impact on the focus of empiri- 
cal research. We suspect that one of the reasons that there has been little work 
on this topic 1s that 1t is quite difficult to obtain longitudinal data on the per- 
formance of a population of network organizations. Moreover, for network 
forms such as strategic alliances, data on dates of founding are generally much 
better than on the dates of failure. Failure rarely occurs on a specific date, and 
even if an alliance is formally terminated on a particular date, the participating 
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actors often do not publicly announce the termination. As a consequence, re- 
searchers cannot rely on archival data to establish failures. 

One initial attempt to reconcile the concern with functionality and dsyfunc- 
tionality is Uzzi (1996, 1997). He argues that embedded transactions are more 
functional than arms-length transactions, though he posits an inverted U- 
relationship between embeddedness and performance. That is, while embed- 
ded transactions are superior to unembedded ones, it nonetheless remains pos- 
sible for an organization to depend too much on embedded ties. If a dispropor- 
tionate number of an organization's ties are embedded, then the organization 
becomes trapped by these relationships. However, even here, dysfunctionality 
is at the level of an actor's entire network rather than at the level of individual 
ties, and the above data on failure are at the tie level, not at the level of the en- 
tire network. 

Some research suggests that the ability to operate in a network form of or- 
ganization is a skill or capability that must be learned, and as a consequence 
the likelihood of failure is related to the experience of the actors with the form. 
Acknowledging the difficulties inherent in working with unrelated entities, 
Powell & Brantley note that “successful firms are those who learn most rapidly 
how to gain from external linkages without creating enemies or behaving op- 
portunistically” (1992, p. 371). That is, the ability to exploit the substantive 
knowledge gained through network relationships without killing the prover- 
bial goose can be viewed as an important capability in its own right (Levitt & 
March 1988), to be learned through experience in network forms of organiza- 
tions (Powell et al 1996, Mody 1993, Gulati et al 1994, Gulati 1995, Westney 
1988, Balakrishnana & Koza 1993). The implication of this research is that the 
likelihood that a network organization will fail decreases with the partners’ ex- 
perience with the form. 

In addition to this general insight on the importance of experience, some 
qualitative field research provides clues and insights into the behavioral deter- 
minants of success and failure at the dyad level (e.g., Doz 1996, Liebeskind et 
al 1996, Parkhe 1991, Larson 1992). For example, Doz (1996) argues that a 
number of factors such as the level of task integration, similarity of organiza- 
tional cultures, and commonality of organizational goals affect the ability of 
alliance partners to learn from one another. Parkhe (1991) also looks at learn- 
ing and adaptation as critical processes underlying the longevity and effective- 
ness of alliances, focusing in particular on how the firms’ diversity affects 
learning and adaptation. 

Unfortunately, with the exception of Larson’s (1992) study, these field 
studies tend to fall outside of the sociological literature. As a result, they do not 
link up with the theoretical constructs that have been of greatest interest to so- 
ciologists. Further work needs to be done to establish a more substantial link- 
age. Especially given the importance of trust 1n sociological accounts, it would 
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be valuable to have some more direct insight into the social and psychological 
processes by which trust in network forms of organization is built up and 
breaks down. Though their analysis is more at the level of interpersonal net- 
works than network organizations, the work of Burt & Knez (1995) provides a 
model for such an examination. 

One could perhaps draw on some of the research on constraints to develop 
hypotheses about dysfunctionalities. For example, as noted above, Gulati 
(1995) demonstrates that the probability that an alliance will form between 
two actors is a function of the indirect connections that these actors have to one 
another. These indirect connections are conduits for information about reputa- 
tion and peer control. We suspect that these should be relevant not only to the 
likelihood of formation but also to the likelihood of failure. That is, less infor- 
mation about reputation and less peer control increase the likelihood that a 
strategic alliance will end in failure. 

Yet, regardless of the direction pursued in future research, it is clear that 
more attention must be given to the factors that determine the success and fail- 
ure of network forms of organization. Once the possibilities for failure are ac- 
knowledged, one can no longer simply add up the number of network ties and 
assume that more ties imply greater learning or greater legitimacy. It seems 
quite plausible to assume that a failed tie with a high status or legitimate actor 
may have more adverse consequences than no tie at all. 


CONCLUSION: A BALANCED APPROACH TO THE 
NETWORK FORM 


A large volume of research has documented the functionality of the network 
form of organization. As noted in the introduction, we suspect that the initial 
impetus for this concern with the functionality of the network form was to cri- 
tique and challenge economic views of organization, as is made quite explicit 
in the writings of Granovetter (1985) and Powell (1990). When explicitly 
linked to a critique, the primary objective was to show that at least under some 
conditions, nonmarket, nonhierarchical forms or organization are functional. 
However, as the literature has evolved, ıt has become decoupled from such an 
explicit critique. Such a decoupling seems a necessary and important stage in 
the evolution of this literature; however, in moving away from the explicit cri- 
tique, researchers must counterbalance the focus on prevalence and function- 
ality with an equally strong focus on constraint and dysfunctionality. Other- 
wise, the literature runs the risk of succumbing to a naive functionalism. 

In moving toward this more balanced consideration of the network form, it 
is important to recognize that the network form represents one of three alterna- 
tive forms of governance, not one of two. In the past, sociologists have typi- 
cally made pairwise comparisons when evaluating network organizations. For 
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example, when discussing the richness of information conveyed through net- 
work ties, the comparison is to arms-length contracts. When discussing the 
autonomy of the network form, the comparison is to hierarchy. However, does 
the network form provide richer information than hierarchies and more auton- 
omy than the market? Only by considering all three forms simultaneously can 
objective assessments of the strengths and weaknesses of the form be made. 


ACKNOWLEDGMENTS 
The authors wish to thank Woody Powell for helpful comments on this review. 


Visit the Annual Reviews home page at 


http://www.AnnualReviews.org. 





Literature Cited 


Baker WE. 1990. Market networks and corpo- 
rate behavior. Am. J Sociol. 96:589—625 

Balakrishnana S, Koza MP 1993 Information 
asymmetry, market failure and joint ven- 
tures: theory and evidence. J. Econ Behav. 

. 20:99—117 

Baum JAC, Oliver C 1991. Institutional link- 
ages and organizational mortality. Admin 
Sci. O. 36:187—218 

Baum JAC, Oliver C. 1992. Institutional em- 
beddedness and the dynamics of organiza- 
tional populations. Am. Sociol Rev 57: 
540—59 

Bourdieu P, Coleman JS, eds. 1991 Social 
Theory for a Changing Society Boulder, 
CO: Westview 

Bradach JL, Eccles RG. 1989 Price, authority, 
and trust: from ideal types to plural forms. 
Annu. Rev. Sociol 15:97-118 

Buckley PJ, Casson M 1988. A theory of co- 
operation in international business See 
Contractor & Lorange 1988a, pp. 31—53 

Burt RS, Knez M. 1995. Kinds of third- 
effects on trust. Ration Soc 7 255—9 

Carosso VP. 1970. Investment Banking in 
America. A History. Cambridge, MA Har- 
vard Univ. Press 

Clawson D 1980 Bureaucracy and the Labor 
Process. The Transformation of US In- 
dustry, 1860-1920 New York: Monthly 
Rev. 

Cohen WM, Levinthal DA 1989. Innovation 
and learning the two faces of R&D Econ 
J 99.569—96 

Cohen WM, Levinthal DA. 1990 Absorptive 


capacity’ a new perspective on learning 
and innovation. Admin Sci Q. 35:128—52 

Contractor FJ, Lorange P, eds. 1988a Coop- 
erative Strategies in International Busi- 
ness Lexington, MA: Lexington Books 

Contractor FJ, Lorange P 1988b. Why should 
firms cooperate: the strategy and economic 
basis for cooperative ventures See Con- 
tractor & Lorange 1988a, pp 3-30 

Dore R 1983. Goodwill and the spirit of mar- 
ket capitalism. Br. J Sociol 34:459—82 

Doz YL. 1996 The evolution of cooperation 
in strategic alliances initial conditions or 
learning processes? Strat Manage J 17- 
55-83 (Sum Suppl) 

Eccles RG 1985. The Transfer Pricing Prob- 
lem Lexington, MA Lexington Books 

Eccles RG, Crane DB. 1987. Managing 
through networks in investment banking. 
Calif Manage Rev 30 176-95 

Editors 1995. Flying in formation. The 
Economist 336(7924) 59-60 

Gerlach ML. 1992 Alliance Capitalism The 
Social Organization of Japanese Business 
Berkeley: Univ. Calif. Press 

Granovetter MS 1974. Getting a Job, a Study 
of Contacts and Careers Cambndge, 
MA Harvard Univ. 

Granovetter MS. 1985. Economic action and 
social structure’ the problem of embedded- 
ness Am J Sociol 91.481—510 

Granovetter MS. 1995 Coase revisited: busi- 
hess groups in the modern economy Ind 
Corp Change 4:93-131 

Grant RM. 1996 Prospering in dynamically- 


NETWORK FORMS OF ORGANIZATION 75 


competitive environments: organizational 
capability as knowledge integration Org 
Ser 7 375-87 

Gulati R. 1995. Social structure and alliance 
formation patterns a longitudinal analysis 
Admin Sct Q 40 619—52 

Gulati R, Gargiulo M 1997. Where do net- 
works come from? Work Pap. Northwest- 
em Univ. Grad. Sch Mgmt 

Gulati R, Khanna T, Nohria N. 1994. Unilat- 
eral commitments and the importance of 
process in alliances Sloan Manage Rev 
35(3).61—69 

Hamel G. 1991. Competition for competence 
and inter-partner learning within interna- 
tional strategic alliances. Strat Manage J 
12:83—-103 (Summ Suppl.) 

Hannan MT, Freeman J 1989. Organizational 
Ecology Cambridge, MA: Harvard Univ 
Press 

Hennart J 1988. A transaction costs theory of 
equity joint ventures. Strat Manage J 9 
361-74 

Hennart J. 1991 The transaction costs theory 
of joint ventures: an empirical study of 
Japanese subsidiaries in the United States. 
Manage. Sci. 37.483—97 

Hirschman AO 1970 Exit, Voice, and Loy- 
alty. Responses to Decline in Firms, Or- 
ganizations, and States. Cambndge, MA 
Harvard Univ. Press 

Inkpen AC. 1996. Creating knowledge 
through collaboration Calif Manage Rev 
39 123-40 

Kanter RM 199] The future of bureaucracy 
and hierarchy in organizational theory: a 
report from the field. See Bourdieu & 
Coleman 1991, pp. 63—90 

Killing JP. 1982. How to make a global joint 
venture work Harvard Bus Rev 60.3 

Kogut B 1988a A study of the life cycle of 
joint ventures Manage. Int Rev 28:39—52 

Kogut B 1988b Jomt ventures. theoretical 
and empirical perspectives Strat Manage 
J 9:319-32 

Larson A 1992. Network dyads in entrepre- 
neurial settings. study ofthe governance of 
exchange relationships Admin Sci Q 37: 
76-104 

Laumann EO 1991 Comment on “The Fu- 
ture of Bureaucracy and Hierarchy in 
Organizational Theory: A Report from the 
Field " See Bourdieu & Coleman 1991, pp 
90-93 

Lazerson M 1993 Future alternatıves of work 
reflected ın eee the putting-out pro- 
duction m Modena In Explorations m 
Economic Sociology, ed R Swedberg, pp. 
403-28 New York Russell Sage Found 

Levitt B, March JG 1988 Organizational 
learning Annu. Rev Sociol 14.319—40 


Liebeskind JP, Oliver AL, Zucker LG, Brewer 
M. 1996 Social networks, learning, and 
flexibility sourcing scientific knowledge 
in new biotechnology firms. Org Sci 7. 
428-43 

Lincoln JR, Gerlach ML, Ahmadyjian C 1992. 
Keiretsu networks 1n the Japanese econ- 
omy adyad analysis of intercorporate ties. 
Am Sociol Rev 1992 57 561-85 

Lincoln JR, Gerlach ML, Ahmadjian C 1996, 
Keiretsu networks and corporate perform- 
ance in Japan Am Sociol Rev 61.67—88 

Mody A 1993 Learning through alliances J 
Econ Behav Org 20 151-70 

Mowery DC, Oxley JE, Silverman BS. 1996. 
Strategic alliances and iterfirm knowl- 
edge transfer Strat. Manage J. 17 77—91 
(Summ Suppl) 

Orru M, Biggart NW, Hamilton GG 1991 
Organizational isomophism in East Asia. 
In The New Institutionalism in Organiza- 
tional Analysis, ed WW Powell, PJ Di- 
Maggio, pp 361-89 Chicago Univ. Chi- 


cago 

Parkhe A 1991. Interfirm diversity, organiza- 
tional learning, and longevity in global 
strategic alliances J Int Bus Stud 22 
579—601 

Parkhe A 1993 Strategic alliance structuring 
a game theoretic and transaction cost ex- 
amunation of interfirm cooperation. Acad 
Manage J 36.794—829 

Perrow C. 1993 Small firm networks. In Ex- 
plorations in Economic Sociology, ed R 
Swedberg, pp 277-402. New York: Rus- 
sell Sage Found 

Pfeffer J, Nowak P. 1976. Jo1nt-ventures and 
mterorganizational interdependence Ad- 
min Sci Q 21:398-418 

Podolny JM 1993. A status-based model of 
market competition. Am J Sociol 98. 
829--72 

Podolny JM 1994 Market uncertainty and the 
social character of economic exchange 
Admin Sci Q 39:458-83 

Podolny JM, Phillips DJ. 1996. The dynamics 
of organizational status. /nd Corp 
Change 5 453—72 

Portes A, Sensenbrenner J. 1993. Embedded- 
ness and immigration. notes on the social 
determinants of economic action. Am J 
Sociol 98 1320-50 

Powell WW. 1990. Neither market nor hierar- 
chy network forms of organization. In Re- 
search in Organizational Behavior, ed B 
Staw, LL Cummings, 12 295—336. Green- 
wich, CT JAI 

Powell WW, Brantley P. 1992. Competitive 
cooperation in biotechnology learning 
through networks? In Networks and Or- 
ganizations Structure, Form and Action, 


76 PODOLNY & PAGE 


ed N Nohria, R Eccles, pp. 366—94. Bos- 
ton Harvard Bus Sch 

Powell WW, Koput KW, Smith-Doerr L 
1996  Interorganizational collaboration 
and the locus of innovation networks of 
learning ın biotechnology Admin Sci Q. 
41.116-45 

Powell WW, Smith-Doerr L. 1994 Networks 
and economic life In The Handbook of 
Economic Sociology, ed NJ Smelser, R 
Swedberg, pp 368-402. Princeton, NJ: 
Princeton Univ Press 

Root RR. 1988. Some taxonomies of interna- 
tional cooperative arrangements. See Con- 
tractor & Lorange 19888, pp. 69-80 

Savona D. 1992. When companies divorce. 
Int Bus 5(11) 48-51 

Selznick P 1949 TVA and the Grass Roots 
Berkeley, CA: Univ. Calif. Press 

Sharfman MP, Gray B, Yan A. 1991 The con- 
text of interorganizational collaboration in 
the garment industry: an institutional per- 
spective J Appl Behav Sci 27 181—208 

Serenson O 1997 Complexity catastrophe 
interdependence and tability ın or- 
ganizational evolution. Work Pap. Univ. 
Chicago Grad. Sch. Bus. 

Stark D 1996. Recombinant property in East 
European capitalism Am J. Sociol 101. 
993-1027 

Stinchcombe A 1965. Social structures and 
organizations. In Handbook of Organiza- 
tions, ed JG March, pp 142-93. Chicago: 
Rand McNally 

Stuart TE, Hoang H, Hybels R. 1997. Interor- 


ganizational endorsements and the per- 
formance of entrepreneurial ventures 
Work. Pap Univ Chicago Grad. Sch Bus 

Stuart TE, Podolny JM 1996 Local search 
and the evolution of capabilities Strat 
Manage. J. 17.21—38 

Stuart TE, Podolny JM. 1997. Positional 
causes and consequences of strategic alli- 
ances in the semiconductor industry. 
Work Pap Univ Chicago Grad. Sch Bus. 

Stuckley JA 1983 Vertical Integration and 
Joint Ventures in the Aluminum Industry. 
Cambridge, MÀ: Harvard Univ 

Uzzi B 1996. The sources and consequences 
of embeddedness for the economic per- 
formance of organizations: the network ef- 
fect Am Sociol, Rev 61.674—98 

Uzzi B 1997 Networks and the paradox of 
embeddedness. Admin Sci Q. 42:35-67 

Westney DE. 1988. Domestic and foreign 
learning curves 1n managing international 
cooperative strategies. See Contractor & 
Lorange 1988a, pp 339-46 

Williamson OE. 1975. Markets and Hierar- 
chies New York. Free 

Williamson OE. 1985. The Economic Institu- 
tions of Capitalism. New York. Free 

Wilhamson OE 1991. Comparative economic 
organization. the analysis of discrete struc- 
tural alternatives. Admin Scr Q 36. 
269—96 

Zajac EJ, Olsen CP. 1993 From transaction 
cost to transactional value analysis: impli- 
cations for the study of interorganizational 
strategies. J. Manage Stud 30:131—45 


Annu Rev Sociol 1998 24 77—103 
Copyright © 1998 by Annual Reviews All rights reserved 


REACTIONS TOWARD THE NEW 
MINORITIES OF WESTERN EUROPE 


Thomas F. Pettigrew 


University of Califorma, Santa Cruz, California 95064; 
e-mail: PETTIGR@CATS.UCSC.EDU 


KEY WORDS. new minorities, blatant prejudice, subtle prejudice 


ABSTRACT 


Millions of ex-colonials, “guest workers,” refugees, and other immugrants 
have settled in western Europe during recent decades. Extensive research on 
this phenomenon broadens sociology’s understanding of intergroup relations 
in industrial societies. Unlike African Americans, these new Europeans are 
often viewed as not “belonging,” and gaining citizenship can be difficult. 
The chapter discusses four major reactions to the new minorities: prejudice, 
discrimination, political opposition, and violence. Both blatant and subtle 
forms of prejudice predict anti-immigrant attitudes. And between 1988 and 
1991, a hardening took place in these attitudes. Similarly, direct and indirect 
discrimination against the new minorities is pervasive. Moreover, anti- 
discrimination efforts have been largely ineffective Far-nght, anti- 
immigration political parties have formed to exploit this situation. These 
openly racist parties have succeeded in shifting the political spectrum on the 
issue to the right. In addition, violence against third-world immigrants has 
increased in recent years, especially 1n nations such as Britain and Germany 
where far-right parties are weakest. The chapter concludes that these phe- 
nomena are remarkably consistent across western Europe. Furthermore, the 
European research on these topics supports and extends North American re- 
search in intergroup relations. 


INTRODUCTION 


The world is experiencing two major intergroup trends—massive migration 
and increased group conflict. An estimated 80 million migrants, almost 2% of 
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the world’s population, live permanently or for long periods of time outside 
their countries of origin (Castles 1993, p. 18). And headlines of intergroup 
strife fill our newspapers. 

These trends are especially evident in western Europe (Solomos & Wrench 
1993, Thraenhardt 1992a). Somalis in London's East End (Griffiths 1997) and 
Cypriot entrepreneurs in the city’s garment industry (Panayiotopoulos 1996), 
Russian Jews in Berlin (Doomernik 1997), Peruvian house servants in Barce- 
lona (Escriva 1997), Senegalese street vendors in Italian cities (Campani 
1993)—every western European city reveals the arrival of immigrants over re- 
cent decades. And every western European nation has seen harsh, often vio- 
lent, reactions to these new minorities. 

An extensive research literature has developed on these groups. This chap- 
ter outlines this work with an eye toward enlarging the sociological under- 
standing of intergroup processes. Such a comparison is important for Ameri- 
can sociology. The discipline has focused on black-white relations in the 
United States. This situation is atypical of the world's intergroup situations on 
many dimensions. African Americans endured two centuries of slavery and 
another of legal segregation. They still face intense racial barriers. They re- 
main the most residentially segregated and have the lowest intermarriage rates 
with whites of any American minority (Pettigrew 1988). Nonetheless, African 
Americans "belong" in the United States (Landes 1955). Not even racists 
question their citizenship. Moreover, they share a language, religion, and a na- 
tional culture with other Americans. Indeed, they are major contributors to the 
most distinctive elements of American culture. 

In short, the position of African Americans is vastly different from that of 
Europe's new minorities. Yet it is the American black-white situation upon 
which much of sociology's study of intergroup dynamics rests. Hence, current 
scholarship on the unfolding scene of majority-minority relations in western 
Europe offers a welcome opportunity to broaden our perspective. Though only 
a ninth of the chapter's citations are from non-English literatures, works in 
English by leading European scholars help to compensate. 


THE NEW MINORITIES 


A Rich Variety of Groups and Contexts 


The variety of new minorities within contrasting national contexts enhances 
the comparative value of intergroup phenomena in western Europe. The new 
Europeans come from Africa, Asia, the Caribbean, the Middle East, and South 
America. And they typically have cultural backgrounds sharply different from 
those of their host nations. Seven million, for instance, are Muslims (Peach & 
Glebe 1995). 
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This 1s not an entirely novel experience for the continent. There were mass 
movements of people after World War I and following the Russian Revolution 
(Kulischer 1948). And western Europe has long had indigenous minori- 
ties—such as the Frisians of the Netherlands and Germany, the Bretons and 
Corsicans of France, the Scots and Welsh of Great Britain, and the Basques 
and Catalans of Spain (Foster 1980). But the new minorities offer a more cul- 
turally diverse intergroup situation than the traditionally emigrating continent 
has experienced. 


FOUR DECADES OFIMMIGRATION Driven by both economic opportunities and 
the decline of European empires, colonial minorities began arriving during the 
1950s. Before independence of their native lands, French colonials were 
French citizens and began coming in growing numbers to France for greater 
opportunities. In Great Britain, London transport and other employers re- 
cruited West Indians for low-wage jobs. While only 2,000 immigrated from 
the islands in 1952, 26,441 came in 1956. By late 1959, Britain's West Indian 
population numbered 126,000 (Rich 1990, pp. 181, 188). 

An especially troubled group were the South Moluccans. Prized soldiers of 
the Dutch East Indian Army, they had fought to maintain Dutch colonization. 
When Indonesia won independence in 1948, many of these soldiers and their 
families migrated to the Netherlands. But, upon arrival, the Dutch decommis- 
sioned them. Stripped of their specialty, many Moluccans became unem- 
ployed and remain today dependent on welfare. Their dream of returning to a 
sovereign South Molucca heightens their plight. Their island is now firmly in 
Indonesia's grip, and their dream has retarded their adjustment to Dutch soci- 
ety. 

The 1960s saw the arrival of contract workers who were not colonials. 
Many of these misnamed "guest workers" were Europeans. Spanish and Portu- 
guese came to France; Italians to France, Germany, and Belgium; Yugoslavs 
and Greeks to Germany; and Turks to the Netherlands and Germany. North 
Africans came soon after to France and the low countries. There were eco- 
nomic and other push factors as well as economic pull factors. Portuguese 
men, for example, avoided induction into their nation's colonial armies fight- 
ing to maintain African colonies. 

Rapid industrial expansion in western Europe in the 1960s fueled the 
worker recruitment. West Germany, undergoing its “economic miracle," des- 
perately needed more workers. It made recruitment treaties with Italy (1955), 
Spain (1960), Turkey (1961—1964), Morocco (1963), Portugal (1964), Greece 
(1965), Tunisia (1965), Yugoslavia (1968), and even South Korea (1968) 
(Thraenhardt 1992b, p. 25). Almost 35,000 North Africans entered France 
each year during this decade (Creamean 1996, p. 51). Indeed, most western 
European countries took part in such recruitment efforts 1n this period 
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The boom years ended with rising oil prices in the 1970s and consequent 
unemployment. Labor recruitment abruptly stopped, and governments devel- 
oped schemes to encourage the "guest workers" to leave. Yet their numbers 
fell only slightly. By the 1980s, the new minorities were again growing in size 
from three sources: family reunion, the high birth and low death rates of their 
young populations, and increasing numbers of refugees. By 1995, resident for- 
eign populations ranged from 3.6% in the United Kingdom to 18.9% in Swit- 
zerland. More than half of these foreigners are from non—European Union 
countries (Waldrauch & Hofinger 1997, p. 274). 


DIFFERENT STATUSES Today the new minorities hold an array of statuses. 
We distinguish seven types. (For a detailed scheme, see Husbands 1991a.) 


1. The most favored are the national migrants—those considered citizens who 
are seen as returning “home.” The special case of the “Saxons” from Roma- 
nia illustrates the extremes this social construction can assume. Though 
separated by eight centuries from Germany, these Aussiedler “return” with 
full citizenship automatically granted them (McArthur 1976, Verdery 1985, 
Wilpert 1993). 

2. Citizens of European Union (EU) countries living in other EU countries also 
are a favored class. Though "foreigners," they have full rights under EU 
agreements. Of 13 million foreign residents in western Europe in 1993, six 
million were western Europeans (Muenz 1996, p. 211). Hence, they often 
constitute a large segment of a nation's foreign residents—such as the Por- 
tuguese in France and Italians in Belgium (Martiniello 1992a, 1993; 
Vranken & Martiniello 1992). Today, however, only rural Portuguese are 
still migrating in large numbers. Many EU migrants return to their native 
lands. Among those who remain, many are second- and third-generation 
residents. 


3. Ex-colonial peoples form a large contingent. These groups usually arrived 
familiar with the host country's culture and language. They include Indians, 
Pakistanis, and West Indians in the United Kingdom, North Africans and 
Southeast Asians in France, Eritreans and Somalis in Italy, and Surinamers 
in the Netherlands. Distinctions are often made among these groups. In 
Britain, “new commonwealth peoples" is the euphemism for ex-colonials 
of color (Miles & Phizacklea 1984, Miles & Cleary 1992, p. 131). 

4. Recruited workers from such noncolonial countries as Turkey form a fourth 
group. Germany patterned the “guest worker" (Gastarbeiter) system after 
the Swiss treatment of Italian workers. The intention was for the recruits to 
rotate before planting family roots. But the Swiss plan involved mostly 
service workers. Skilled work required training, and companies were un- 
willing to rotate their "guests" and lose their human capital investment. 
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Soon families joined the workers, and migratory chains formed. The guests 
had come to stay (Thraenhardt 1992b). 


5. Refugees and asylum seekers are an increasingly large cluster among the 
new minorities. About 15 million people throughout the world claim this 
status, though most go from one third-world country to another. While only 
5% are in western Europe (Santel 1992, p. 107), their arrivals in EU coun- 
tries rose rapidly during the 1980s—from 65,000 in 1983 to 289,000 in 
1989 (Castles 1993, p. 18). It reached a peak in 1992 with 700,000 applica- 
tions but, with tightened regulations, declined to 300,000 in 1994 (Koser 
1996, p. 153). 


6. Accepted illegal immigrants are those who, while not legal, are known to 
authorities and tolerated as long as they are economically useful. Polish 
construction workers in Germany and African harvest workers in Italy are 
two examples. These groups are vulnerable to the whims of officials and the 
economy, and they receive no social welfare benefits. In contrast, such 
prosperous illegals as the English in Portugal do not register so as to avoid 
taxes (Miles 1993). 

7. Rejected illegal immigrants are the true illegals. Since there is no perceived 
economic need for them, authorities often deport them. Organized criminal 
groups from eastern Europe and Russia are often in this group. Many gener- 
alize justifiable opposition to such groups into opposition to all immigrants. 


The fuzzy boundaries of these types overlap. Asylum seekers are a highly 
diverse group and constitute an especially slippery social construction (Castles 
1993, p. 19, Joly 1996, Koser 1996, Santel 1992). The 1951 Geneva Conven- 
tion of the United Nations defined a “political refugee" narrowly: persons with 
a ^well-founded fear" of persecution in their native lands because of their race, 
religion, nationality, or political opinions. This definition excludes victims of 
generalized oppression, civil wars, or natural disasters as well as economic 
refugees. With rapid population and slow economic growth in much of the 
world, more asylum seekers try to escape poverty—not persecution as the 
United Nations defines ıt. 

In the 1990s, the European Parliament enunciated its “safe country of origin 
principle.” Designed to harmonize EU policies toward asylum seekers, the 
Parliament returned to the narrow UN definition to exclude many “unfounded 
applications" (European Parliament 1997). The policy has had a chilling ef- 
fect. In the Netherlands, for instance, the number of asylum seekers declined 
from 53,000 in 1994 to about 21,000 in 1996 (Muus 1996/7). 


Belongingness and Citizenship 


The new minorities often find citizenship to be a major barrier. Without the 
New World's immigration traditions, Europeans lack a “melting pot" meta- 
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phor and a sense that immigration is “normal.” Nationality often carries bio- 
logical connotations— "British stock," as Margaret Thatcher phrased it 
(Thraenhardt 1992b, p. 16). Thus, many view the new minorities as not be- 
longing—even the growing numbers of the second- and third-generation who 
have lived only ın the host nation. 

This sense of not belonging interacts with citizenship. Here the nations vary 
widely (Thraenhardt 1992b). Sweden and the Netherlands are “the most wel- 
coming for 1mmigrants" (Waldrauch & Hofinger 1997, p. 278). They boast the 
highest rates of naturalization relative to their populations, and they allow vot- 
ing in local elections for immigrants before citizenship (Hammar 1993). AI- 
though becoming more selective (Alund & Schierup 1993), Sweden provides 
courses in its language and culture, and naturalization for immigrants after five 
years 

Britain and France, though increasingly restrictive, have allowed extensive 
naturalization for ex-colonial peoples. And most ofthe second generation born 
in the United Kingdom or France receive citizenship. Three nations without 
former colonial subjects—Germany, Austria, and Switzerland—are by far the 
most restrictive (Waldrauch & Hofinger 1997). Turks provide a revealing ex- 
ample. By the mid-1990s, less than 5% of resident Turks had gained citizen- 
ship in Germany compared to more than a fifth in the Netherlands. In 1995, the 
Netherlands granted the largest number of naturalizations in its his- 
tory—7 1,000, twice that of 1992 (Muus 1996/7). 


A Time of Threat and Change 


Western Europe has experienced dramatic economic changes during the final 
decades of this century. The oil shocks of the 1970s reminded Europeans of the 
vulnerability of their economies. As in the United States, European govern- 
ments began to give deficit reduction and global competitiveness priority over 
social and distributive justice (Stasiulis 1997). “Downsizing” the work force 
took hold, and unemployment mounted. Guest-worker programs ended, but 
the foreigners did not leave. Indeed, more immigrants arrived as families re- 
united, and the entry of refugees increased. Thus, the urban concentrations of 
the new minorities expanded. As unemployment intensified from the eco- 
nomic restructuring in the 1980s, it became easy to blame the foreigners. 

These economic phenomena took place in a context of equally sweeping 
political alterations. The power of nation-states began to erode as European 
unity advanced, while regional claims for autonomy grew. The Communist re- 
gimes in the East imploded, the Berlin Wall fell, Yugoslavia broke up into con- 
tending ethnic enclaves, and German unification came suddenly. Societal dis- 
equilibria swept central and eastern Europe. 

Many worried that a "flood" of eastern Europeans would "pour" in. Ger- 
many introduced a new Gastarbeiter policy in 1990 involving eastern Euro- 
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pean governments (Rudolph 1996). Germany now has about a million East 
European nationals within its borders, most of whom are Polish or from the 
former Yugoslavia (Carter et al 1993, p. 492). Overlooked in the public debate 
is that every western European nation's natural increase (births over deaths) 
has declined since 1960 while its economy has expanded—making immigra- 
tion essential for continued prosperity (Munz 1996, Thraenhardt 1996). 

Nevertheless, such events create threat. They set the scene for scapegoating 
the culturally different “others” in their midst. Quillian (1995) shows that 
group threat 1s important. Defined as the interaction of high non-EU minority 
percentage and low gross national product, 1t accounts for 70% of the variance 
in anti-immigrant attitude means across the 12 EU nations (also see Fuchs et al 
1993). 


MAJORITY PREJUDICE AGAINST THE NEW 
MINORITIES 


We can assess attitudes toward the new minorities with a rich data source. In 
1988, the Eurobarometer Survey 30 asked seven probability samples a range 
of prejudice measures about a variety of minorities (Reif & Melich 1991). In 
West Germany, the survey asked 985 majority respondents about Turks. In 
France, it asked 455 about North Africans and 475 about southeastern Asians. 
In the Netherlands, it asked 462 about Surinamers and 476 about Turks. And in 
Great Britain, it asked 471 about West Indians and 482 about Pakistanis and 
Indians (Pettigrew et al 1998, Zick 1997). 


Blatant and Subtle Prejudice 


Two key measures distinguish between blatant and subtle types of prejudice 
(Pettigrew & Meertens 1995). Blatant prejudice is the traditional form; it is 
hot, close, and direct. The ten items that tap it involve open rejection of minori- 
ties based on presumed biological differences. Subtle prejudice is the modern 
form; itis cool, distant, and indirect. The ten items that measure it are not read- 
ily recognized as indicators of prejudice. They tap the perceived threat of the 
minority to traditional values, the exaggeration of cultural differences with the 
minority, and the absence of positive feelings toward them. American re- 
searchers have studied similar distinctions (Pettigrew 1989, Sears 1988). And, 
as various writers had proposed (Barker 1982, Bergmann & Erb 1986, Essed 
1990), it proved equally useful in Europe. 

Figure 1 shows the blatant and subtle scale means for the seven samples. 
Four major findings emerge. 1. The subtle means are consistently higher than 
those of the blatant scale, because the subtle items are covert and more socially 
acceptable (Pettigrew & Meertens 1995, 1996). 2. The means for blatant 
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Figure 1 Blatant and subtle prejudice across seven samples 


prejudice are conspicuously higher for German attitudes toward Turks and 
French attitudes toward North Africans. This result suggests that norms 
against the open expression of prejudice are weakest in these two instances. 
The means for subtle prejudice, however, reveal less variability. 3. Target dif- 
ferences exist in two nations—less French prejudice against Asians than North 
Africans, and less Dutch prejudice against Surinamers than Turks. Note these 
preferences place greater weight on cultural than racial similarities. 4. Ob- 
serve, too, the distinctive data of the Dutch. They are significantly lower on 
blatant prejudice, but not on subtle prejudice. The contrast is striking when we 
compare similar target groups. The Dutch blatant mean for Turks is signifi- 
cantly lower than that of the Germans for Turks. And the Dutch blatant mean 
for Surinamers is significantly lower than that of the British for West Indians. 
Yet the Dutch subtle means are higher than these comparisons. In normative 
terms, this unique pattern outlines the famed “tolerance” of the Netherlands. 
There exists a stern Dutch norm against b/atant prejudice. But subtle prejudice 
slips in under the norm, unrecognized as prejudice (Pettigrew & Meertens 
1996). 

Across the seven samples, the blatant and subtle prejudice scales correlate 
between +.48 and 4.70. The two measures share the same correlates in all sam- 
ples (Meertens & Pettigrew 1997). Both the blatantly and subtly prejudiced are 
less educated and older. They report less interest in politics but more pride in 
their nationality. They less often think of themselves as “Europeans” (Petti- 
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Figure 2 Three prejudice types and attitudes on 1mmigrant nights 


grew 1998). They are more politically conservative; but subtle prejudice is not, 
as some claim, simply a reflection of conservatism (Meertens & Pettigrew 
1997). The prejudiced also are more likely to have only ingroup friends (Petti- 
grew 1997). Finally, they reveal a strong sense of group, but not individual, 
relative deprivation. Thus, the prejudiced sense a group threat to “people like 
themselves" from minorities, but not a sense of personal threat. These corre- 
Jates replicate findings of American research. Since these extensive data in- 
volve seven independent samples, four nations, and six target minorities, this 
replication is of theoretical significance. 


Attitudes Toward Immigration 


Do the blatant and subtle prejudice measures predict attitudes toward the sali- 
ent issue of immigration? Consider the differences among three types of re- 
spondents. Equalitarians are those who score below the central point (not the 
mean) of both the blatant and subtle scales. Bigots score above the central 
points of both scales. The subtles are the most interesting; they score low on 
blatant but high on subtle prejudice. They reject crude expressions of preju- 
dice. Still, they view the new minorities as “a people apart” who violate tradi- 
tional values and for whom they feel little sympathy or admiration. (A fourth 
logical type, those high on blatant but low on subtle prejudice, occurs in less 
than 3% of the sample.) 


86 PETTIGREW 


Figures 2 and 3 show the results for all 3800 majority respondents, and 
these results replicate 1n all seven samples. In Figure 2, most Bigots wish to re- 
strict immigrants' rights further. Most Equalitarians favor extending immi- 
grants' rights. By contrast, many Subtles simply wish to leave the issue as it is. 
When asked if government should make citizenship easier for immigrants, the 
three types line up as expected. While most Equalitarians think naturalization 
procedures should be easier, most Subtles and B1gots disagree. 

The surveys also included a scale of immigration positions that allowed 
multiple responses. “... The government should... (1) send all Asians, even 
those born in France, back to their own country. (2) Send only those Asians 
who were not born in France back.... (3) Send only those Asians back who are 
not contributing to the economic livelihood of France. (4) Send only those 
Asians who have committed severe criminal offenses back... (5) Send only 
those Asians who have no immigration documents back... (6) The government 
should not send back to their own country any of the Asians now living in 
France." 

In Figure 3, differences between the types also appear on this measure. 
Many Bigots want to send all immigrants home. Equalitarians often favor not 
sending back any immigrants. Subtles typically support sending immigrants 
home only when there is an ostensibly nonprejudicial reason for doing so—if 
they have committed crimes or do not have their documents. These differences 
among the types are statistically significant in all samples. 
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Are Attitudes Toward Foreigners Becoming More Negative? 


Unfortunately, the Eurobarometer surveys have not repeated the extensive 
1988 measures of prejudice. They have, however, repeatedly asked several 
relevant questions (Melich 1995). Figure 4 shows the rising percentage of 
Europeans who believe there are "too many” non-EU foreigners in their coun- 
try. For each national sample shown and the 12 EU nations (EU12) combined, 
the sharpest increase occurs between 1988 and 1991. (Preunification 1988 data 
for East Germany were not attainable.) Clear majorities in Belgium, West Ger- 
many, France, and Italy agreed during the early 1990s that the number of for- 
eigners is excessive. There was, however, less of this feeling by 1994, espe- 
cially in Germany and Italy. The decline in Germany may well reflect changes 
in the constitution that made it difficult for asylum seekers to gain entry. 

Figure 5 tells a similar story. Abrupt increases in the numbers of those who 
wish to restrict the rights of non-EU nationals again occur between 1988 and 
1991. Yet not all indicators show this effect. The percentage of respondents 
who find the presence of non-EU nationals “disturbing” does not rise much 
over these years. Hence, western European opinion toward foreigners did 
harden during the years when the issue took center stage and political leaders 
defined immigration as a serious problem. Yet the increases in negative atti- 
tudes are not so large as to explain the rise in political and violent actions 
against immigrants. 
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Figure 4 Too many non-European Union nationals in the country, 1988-1994 
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Figure 5 The rights of non-EU nationals should be restricted, 1988—1992 


DISCRIMINATION AGAINST THE NEW MINORITIES 


Direct and Indirect Discrimination 


Discrimination against the new minorities is pervasive throughout western 
Europe (Castles 1984, MacEwen 1995, World Council of Churches Migration 
Secretariat 1980). But, save for the United Kingdom and the Netherlands, the 
problem has received far less attention than that of violence. 

Both direct and indirect discrimination are involved (Pettigrew & Taylor 
1991). Direct discrimination, like blatant prejudice, is straightforward. It oc- 
curs at points where inequality is generated, often intentionally. Indirect dis- 
crimination, like subtle prejudice, 1s less obvious. It involves the perpetuation 
or magnification of the original injury. It occurs when the inequitable results of 
direct discrimination are used as a basis for later decisions (“past-in-present 
discrimination”) or decisions in related institutions (“side-effect discrimina- 
tion”; Feagin & Feagin 1986). Indirect discrimination, a result of systemic pat- 
terns, 1s largely unrecognized in Europe. 

Investigators have repeatedly uncovered direct discrimination in England 
(Amin et al 1988, Daniel 1968, Gordon & Klug 1984, Smith 1976). Controlled 
tests reveal the full litany of discriminatory forms involving employment, pub- 
lic accommodations, housing, the courts, insurance, banks, even car rentals. 
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One study sent identical letters for 103 advertised, nonmanual jobs from native 
white, West Indian, and Asian applicants. The letters contained multiple cues 
of ethnicity. Firms refused the white an interview only six times when they 
granted one to a minority candidate. But on 49 occasions, they called the white 
for an interview and refused to interview the minority candidates (Hubbuck & 
Carter 1980). 

In Germany, a reporter dyed his moustache black, dressed in guest-worker 
style, and tried to get a drink in bars and cafes throughout Frankfurt. Repeat- 
edly, he was refused service and thrown out (Castles 1984, p. 191). Better-con- 
trolled field studies by social scientists reveal differential, face-to-face treat- 
ment ofthe new minorities in Britain, Germany, and the Netherlands (Den Uyl 
et a1 1986, Klink & Wagner 1998, Sissons 1981). Their results closely resem- 
ble those of similar field studies of discriminatory acts in the United States 
(Crosby et al 1980). 

Employment discrimination poses the most serious problem. In every west- 
ern European nation, foreigners have far higher unemployment rates than do 
natives. In 1990 in the Netherlands, Moroccans and Turks had unemployment 
rates above 40%, and the Surinamese 27% compared with the native Dutch 
rate of 1396 (Pettigrew & Meertens 1996). During the 1974—1977 recession, 
West German manufacturing reduced its labor force by 765,000— 4296 of 
whom were foreign workers (Castles 1984, p. 148). 

There are many reasons in addition to discrimination for this situation. The 
“last-in, first-out” principle selectively affects the younger foreign workers. 
Being typically less skilled, they are more affected by job upgrading. Foreign- 
ers also are more likely to be in older, declining industries in such areas as the 
Ruhr Valley. Indeed, planners put Gastarbeiter into these industries for 
cheaper labor precisely because of their decline. Some emphasize these factors 
to deny any role for discrimination. Yet these factors offer insufficient expla- 
nations for the greater unemployment of foreigners. Veenman & Roelandt 
(1990) tested how much education, age, sex, region, and employment level ac- 
count for the large group discrepancies in Dutch unemployment rates. They 
found that these factors explained only a small portion of the differential rates. 

Indirect discrimination operates when the inability to obtain citizenship re- 
stricts the opportunities of non-EU minorities in most institutions. It restricts 
their ability to get suitable housing, employment, and schooling for children. A 
visa is required for travel to other EU countries. In short, the lives of nonciti- 
zens are severely circumscribed (Wilpert 1993). 

Castles (1984) contends that the guest-worker system was itself a state- 
controlled system of institutional discrimination. It established the newcomers 
as a problematic and stigmatized outgroup, suitable for low-status jobs but not 
for citizenship. For initial victims of such direct discrimination, indirect dis- 
crimination in all areas of life was inevitable. 
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Wilpert goes further. She asserts that Germany's institutions are based on 
"a dominant ideology which distributes rights according to ethnic origins..." 
(Wilpert 1993, p. 70). The revealing comparison is between the almost two 
million Aussiedler and the Gastarbeiter. Officials regard the former as kin of- 
ten on the thinnest of evidence, though since 1996 a language test must be 
taken. Aussiedler readily become citizens and receive favorable government 
treatment. Yet even third-generation Turks, who are at least as culturally *Ger- 
man" as the Aussiedler, are largely denied citizenship and given unfavorable 
treatment. 


Anti-Discrimination Remediation 


Basic rights in Germany are guaranteed only to citizens. So, the disadvan- 
tages of noncitizenship include limited means to combat discrimination 
(Layton-Henry & Wilpert 1994). Extensive German legislation combats anti- 
Semitism and Nazi ideology, but these laws have proved difficult to apply to 
noncitizens. 

The German constitution explicitly forbids discrimination on the basis of 
origin, race, language, beliefs, or religion—-but not citizenship. Indeed, the 
Federal Constitutional Court (Bundesverfassungsgericht) has ruled that differ- 
ential treatment based on citizenship is constitutional if there is a reasonable 
basis for it, and if it ıs not wholly arbitrary. In practice, this has meant a court 
has upheld charging foreign bar owners higher taxes than German bar owners. 
And restaurants can refuse service to Turks and others on the grounds that their 
entry might lead to intergroup disturbances. According to the German legal 
specialist Dan Leskien, Germany needs anti-discrimination legislation with 
broad enforcement powers and an effective monitoring system (Layton-Henry 
& Wilpert 1994, pp. 19—22). 

Few means of combating discrimination are available in France either. 
Commentators often view discrimination as "natural," as something univer- 
sally triggered when a "threshold of tolerance" (seuil de tolerance) is sur- 
passed (MacMaster 1991). Without supporting evidence, this rationalization 
supports quotas and dispersal policies that limit minority access to suitable 
housing. 

The Netherlands, United Kingdom, and Sweden have enacted anti- 
discrimination legislature that specifically applies to the new minorities. Not 
coincidentally, these countries make citizenship much easier to obtain than 
Germany. Yet this legislation has been largely ineffective for two interrelated 
reasons. First, European legal systems do not allow class action suits—a force- 
ful North American weapon to combat discrimination. Second, European ef- 
forts rely heavily on individual complaints rather than systemic remediation. 
Britain's 1976 Act gave the Commission for Racial Equality power to cast a 
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broad net, but individual complaints remain the chief tool (MacEwen 1993, 
1995). 

It is a sociological truism that individual efforts are unlikely to alter such 
systemic phenomena as discrimination. Mayhew (1968) showed how individ- 
ual suits and complaints are largely nonstrategic. Minorities bring few charges 
against the worst discriminators, because they avoid applying to them. Com- 
plaints about job promotion are common, but they are made against employers 
who hire minorities. Thus, effective anti-discrimination laws must provide 
broad powers to an enforcement agency to initiate strategic, institutionwide 
actions that uproot the structural foundations of discrimination. 


POLITICAL RESPONSES 
The Rise of Far-Right Anti-Immigration Parties 


By the 1980s, the new Europeans elicited an increasingly hostile reaction from 
sectors of the native populations that felt especially threatened. Throughout 
western Europe, extreme right-wing groups seized on the threat as their central 
issue. In each election for the European Parliament, the average vote for these 
anti-immigration parties has risen—3.4% in 1979, 4.9% in 1984 to 5.1% in 
1989. By 1994, with an average of 11.1% unemployment in the European Un- 
ion, the far-right parties garnered 6.9% of the vote and 25 seats (3 from Bel- 
gium, 11 each from France and Italy). Indeed, a close relationship is found 
across countries between the extreme right's share of the European Parliament 
votes and unemployment (Baimbridge et al 1994, 1995). 

It is a mistake to view the new European right as simply Nazi revivals. To 
be sure, they share the classic authoritarian personality orientation—calls for a 
strong leader and law and order, beliefs in conspiracy theories, and an exclu- 
sionary view of citizenship. But, as Kitschelt (1995) makes clear, times have 
changed and so has the radical right. He distinguishes four types of far right. 
Two have only tiny constituencies—traditional fascists and “welfare chauvin- 
ists." Two other types, however, have attracted strong followings—the new 
radical right and “populist anti-statists." Unlike the first two varieties, these 
right-wing movements heartily endorse free market capitalism. They are po- 
litical, not economic, populists. And the anti-statists gain ground in Italy and 
Austria, where patronage is based on the traditional party system. Yet one cen- 
tral theme runs through all four types—nativism and stern opposition to immi- 
gration. 


AUSTRIA  Begun after World War II under a former member of the Nazi SS, 
Austrias Freedom Party (FPO) met with only modest success until Joerg 
Haider became its leader in 1986. A multimillionaire with a populist flair, 
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Haider fashioned the FPO into the strongest far-right party in western Europe 
and a major party of Austria. He gained international notoriety when, as gover- 
nor of Carinthia province, he praised Nazi labor practices as a good way to re- 
duce the welfare rolls (Feen 1996, Wise 1995). Not surprisingly, Haider and 
his party vigorously oppose immigration, bilingual education, and immi- 
grants' rights. 

In 1994, Haider's Freedom Party won 42 of 183 parliamentary seats with 
more than 229^ of the vote. In 1996, the party won 29 seats in Vienna's city 
council with the anti-immigration slogan, “Vienna must not become like Chi- 
cago" (Shanker 1996). Despite Haider’s ambiguous remarks about Nazis, he 
claims the party is leaving its Nazi roots. Indeed, the FPO has strong free mar- 
ket and anti-statist positions (Betz 1994, Kitschelt 1995, Parkinson 1989). It 
offers a classic case of Kitschelt's (1995) anti-statist type. Nonetheless, violent 
groups in Austria, such as the one that desecrated a new Jewish cemetery in 
November 1992, use Nazi symbols and proclaim support for Haider and his 
party (Husbands 1993, p. 113). 


BELGIUM Until 1979, the Volksunie was the only nationalist party of Flan- 
ders. Then dissidents formed the Vlaams Blok (VB)—a prime example of 
Kitschelt's “new radical right" type. This party stands for a separate Flanders 
that would someday join the Netherlands, and it opposes non-European resi- 
dents. It would set up a fund for their repatriation, expel them after three 
months of unemployment, prohibit family reunion, and levy a tax on their em- 
ployers. With this program, the VB has increased its vote 1n every general elec- 
tion since 1981. Centered in Antwerp, it now attracts about one eighth of Bel- 
gium's Flemish voters (Govaert 1995). 


FRANCE The best known of Europe's far-right parties is Jean-Marie Le Pen's 
Front National (FN)—another example of "the new radical right." Formed in 
1972, the FN suffered repeated electoral reverses until the 1980s. It gained re- 
spectability in 1983 when conservatives joined them in a second ballot in the 
small city of Dreux. From 1984 to the present, the party has consistently at- 
tracted between 9% and 16% of the vote in a variety of national elections 
(Singer 1991; Husbands 1991b, Table 3). In 1995, for example, Le Pen re- 
ceived more than four million votes for president—more than 15% of the total. 

The FN is a nationalist and populist party that has carved out a stable slot in 
French politics (Birenbaum 1992, Fysh & Wolfreys 1992, Husbands 1991a, 
Marcus 1995, Mayer & Perrineau 1992, Shields 1989, Tezenas du Montcel 
1995). It is especially strong in cities of the Southeast and Northeast—areas hit 
hardest by industrial decline and “the exhaustion of the working-class move- 
ment” (Wieviorka 1993, p 64). Yet it also has secured a modest hold through- 
out the country. 
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It has a broad policy program, but its key position 1s anti-immigration. Its 
leader, Le Pen, expresses blatant bigotry against a range of immigrants and mi- 
norities—from Jews to the Harkis who fought with the French against Alge- 
rian independence. “Two million unemployed" he asserts, “that's two million 
immigrants too many" (Gunn 1989, p. 23). Increasingly, the French under- 
stand Le Pen's position. National surveys show 5396 in 1996, compared with 
43% in 1990, “understand the Le Pen vote" considering “the behavior of cer- 
tain immigrants" (Commission Nationale Consultative des Droits de l'Homme 
1997, p. 371). 

Given its broad base, the FN's voters do not differ from the general French 
electorate as much as some writers suggest (Husbands 1991b). There are, how- 
ever, consistent findings in the many studies ofthe phenomenon. Like support- 
ers of other radical right parties, FN voters are typically males (women are 
more attracted to European Green parties; Betz 1994, p. 143). And they are 
particularly numerous among small businessmen and craftsmen as well as 
white ex-colonials. Since 1984, the FN has attracted an increasing percentage 
of manual workers. Yet most of their gains are at the expense of other right- 
wing parties, not the declining French Communists, as some claim. And those 
French who live in communities with more than 10% foreign populations are 
less prejudiced and more accepting of immigrants than are other French (Com- 
mission Nationale Consultative des Droits de l'Homme 1997, p. 388). There 
also is a vigorous anti-FN countermovement (Mayer 1995), fueled by students 
(Husbands 1991b). 


GERMANY Le Pen’s success inspired the German right. In the 1986 Bavarian 
state elections, the Christian Social Union (CSU) began a furious campaign 
against third-world refugees. Their electoral success put the issue on center 
stage. 

This event also saw a new far-right party split from the CSU. With anti- 
immigration its chief issue, the Republikaners under their leader, Franz 
Schoenhuber-—a former SS member—offered a populist-nationalist alternative 
similar to that of the Front National. While centered in Bavaria, the Repub- 
likaners enjoyed success 1n elections elsewhere from 1989 to 1993. In 1989, 
they won 90,000 votes and 11 parliament seats in West Berlin. And they gar- 
nered more than two million votes and six seats in a European Parliament elec- 
tion. They also did well in state elections in Baden Wuerttemberg in 1992 and 
Hessen in 1993. However, the Kohl government's partial adoption of their pro- 
gram blunted their appeal (Atkinson 1993, Thraenhardt 1992b, Wilpert 1993). 
By the 1994 European Parliament election, their strength had dissipated. 


GREAT BRITAIN From the 1950s till the late 1960s, a policy consensus on 1m- 
migration existed between the Labor and Conservative parties. It depoliticized 
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race, allowed limited nonwhite immigration, but did little to improve the status 
of immigrants (Messina 1989, Rich 1990). The crowning achievement of this 
era—the 1968 Race Relations Act—lacked the necessary enforcement powers 
to be effective. North American specialists (including the writer) had warned 
the act's drafters ofthe need for structural teeth, but their advice was ignored. 

Ending this cozy arrangement, Enoch Powell began an anti-minority cam- 
paign in 1968 (Schoen 1977). A maverick Conservative member of Parlia- 
ment, Powell predicted “rivers of blood” if nonwhites continued to come to 
Britain. He opposed anti-discrimination legislation and called for immigration 
restrictions and nonwhite repatriation. His message struck a popular chord in 
British opinion. So popular, in fact, it broke the consensus and molded the 
Conservatives into an anti-immigration party. "Almost single-handedly,” 
writes Messina (1989, p. 105), “Powell prepared the intellectual groundwork 
for the emergence of Margaret Thatcher as Conservative party leader in 1975.” 

In 1967, England’s National Front mobilized far-right opposition to the 
new minorities. It had minor success, especially in the 1977 elections of the 
Greater London Council. Yet, as in Germany, the government assumed much 
of the Front’s position. Thatcher won election in 1979, aided by her anti- 
immigration stand. She slashed the budget of the Commission for Racial 
Equality and pushed through revisions of the immigration rules designed to 
end primary nonwhite entry. 


THENETHERLANDS Unlike Britain, the major Dutch parties have maintained 
an enlightened consensus throughout this period. The focus of this consensus, 
however, has shifted. In the 1980s, it emphasized the collective integration of 
minorities. Now it stresses the integration of minority individuals into the la- 
bor market (Fermin 1997). Still, a misnamed Centrum Party formed in 1980 to 
exploit the immigration issue. Two years later, it secured one seat in the lower 
house of parliament (Tweede Kamer). In 1986, the party split into two—the 
Centrum Party 86 and the Centrum Democrats. Only the latter has secured 
seats in the Tweede Kamer—one in 1989, three in 1994, Some members sit on 
municipal councils (van Donselaar 1993). Yet, with only modest strength in a 
few urban pockets, the Dutch far-right has failed to crack the dominant consen- 
sus on the new minorities. 


SWITZERLAND Economic insecurity is a facilitating factor in far-right oppo- 
sition to immigration. Yet prosperous Switzerland shows that it is not a neces- 
sary factor. Àn extremist alliance, National Action Against the Swamping of 
the People and Homeland, came forward with a radical initiative in 1970. It 
proposed to cut the foreign population almost in half, and the initiative won the 
support of 46% of Swiss voters. The alliance later split into two small parties, 
both with parliamentary representation. They have kept immigration contro- 
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versial, and gained power in Geneva as vigilants (Thraenhardt 1992b, pp. 
42-44). 


SOUTHERN EUROPE For Greece, Italy, Portugal, and Spain, large-scale immi- 
gration is a new phenomenon. Long accustomed to emigration, these nations 
have been slow to adjust to their new situation. Italy has attracted the most im- 
migrants, many of them illegals. Under EU pressure, Italy belatedly began to 
formulate immigration policies in 1986 (Campani 1993, Martiniello 1992b). 
Its mishandling of Albanian refugees in 1991, however, revealed how Italy re- 
mains unprepared to be an immigrant receiving country (Vasta 1993). 

Figures 4 and 5 record the pointed rise in Italian concern over immigration 
after 1988. This suggests that a sudden increase in foreigners, rather than the 
actual proportion of foreigners, is key to predicting change 1n European atti- 
tudes toward immigration. In Italy, both national and regional political parties 
have exploited the public's changing attitudes. At the national level, a neo- 
fascist party (Movimento Sociale Italiano) and the Partito Repubblicano Itali- 
ano have taken anti-immigration positions. And so have regional secessionist 
parties, especially the Leagues of Tuscany and Lombardy (Martiniello 1992b, 
Vasta 1993). Indeed, a blatantly racist platform helped the Lega Lombarda 
gain success in the local elections of May 1990. More recently, the Lega Nord 
and the Alleanza Nazionale have added to anti-immigration agitation. 


The Thraenhardt Thesis 


Thraenhardt (1995) contends that these politica] phenomena are similar across 
France, Germany, and the United Kingdom. While far-right efforts have 
gained only minimal power directly, they have shifted the entire political spec- 
trum to the right on immigration. Left-wing and center politicians have 
equivocated, sometimes even collaborated. Conservative politicians have ex- 
ploited the situation for two reasons. First and foremost, they see an opportu- 
nity to obscure economic issues and seize a share of the left's labor vote. And, 
second, they fear the loss of supporters to the far right. Thraenhardt (1992b, p. 
49) credits racist appeals as vital to conservative victories in all three nations. 
Conservative governments have made repeated concessions to anti- 
minority sentiments. They have “played an important role in promoting xeno- 
phobia and putting it on the public agenda" (Thraenhardt 1995, p. 337). Former 
Prime Ministers Thatcher and Major in the United Kingdom, Chancellor Kohl 
in Germany, and President Chirac in France have all espoused restrictions on 
immigration and citizenship that partly meet the far right's demands. Note 
these similarities across the three nations emerged despite sharp differences in 
their immigrant populations. Note also the policy inconsistency: Conservative 
parties actively pursued immigration to provide cheap labor for industry; now 
they stigmatize and scapegoat the foreigners who they earlier had invited. 
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The Thraenhardt thesis applies beyond Europe. Recall 1968 in the United 
States when Alabama Governor George Wallace helped to create a climate that 
moved President Nixon to the right on civil rights. The Republican Party has 
played "the race card" ever since, and converted the white South into its major 
base of support (Carmines & Stimson 1989, Kinder & Sanders 1996). Simi- 
larly in Australia, the rise of a far-right, populist politician, Pauline Hanson, 
has pushed Prime Minister John Howard's Liberal Government further to the 
right on racial and immigration issues. 


VIOLENCE AGAINST THE NEW MINORITIES 


Increasing Anti-Immigrant Violence 


Playing "the race card" heightens intergroup tensions and risks violence. In- 
deed, Europe's political shift to the right accompanied a rise in anti-minority 
violence (Bjorgo & Witte 1993, Koopmans 1995, Witte 1995). In 1990, vio- 
lent attacks against African street vendors in Florence, Italy, and the desecra- 
tion of Jewish graves in Carpentras, France, made headlines. 

The most publicized attacks occurred in Germany (Atkinson 1993, Heit- 
meyer 1993). In September 1991, a mob attacked and besieged a residence of 
asylum seekers in Hoyerswerda. Soon imitative acts of brutality erupted, the 
worst being riots and murders 1n Rostock, Moelln, Solingen, and Magdeburg. 
Passive onlookers and ineffective police characterized these horrendous 
events. Initially, Asian and African refugees were the primary targets. Later, 
Turks also became victims (Wagner & Zick 1997). Moreover, the intensity of 
the political debates on the constitutional rights of asylum seekers closely par- 
alleled these acts of extreme-right violence (Gerhard 1992, Koopmans 1995, 
p. 27, Zick & Wagner 1993). 

Germany was not alone. Britain (Gordon 1993), France (Lloyd 1993), the 
Netherlands (van Donselaar 1993), Scandinavia (Bjorgo 1993, Loow 1993), 
and the rest of Europe have all experienced patterned, anti-minority violence. 
Differences in record keeping and definitions of violence preclude precise 
cross-national comparisons (Koopmans 1995, Witte 1995). Nonetheless, 
sharp differences in racist violence exist. Per million inhabitants over the 
1988—1993 period, England and Wales have had as many or even more racist 
acts as Germany. France, Norway, and Denmark have far lower rates. Switzer- 
land has a high rate of deaths due to racist and extreme-right violence (Koop- 
mans 1995, pp. 9-14). Save for Switzerland, however, the lethality of this 
European violence has not rivaled that of sectarian violence in Northern Ire- 
land or of the Ku Klux Klan in the late nineteenth century in the southern 
United States. 

The far-right does not commit all the racist violence. Some youthful perpe- 
trators evince little or no right-wing ideology (Willems 1995). Their violence 
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often involves the affect-arousing context of sports (Holland 1995). And not 
all right-wing targets are minorities. Still, in Germany and the Netherlands in 
the early 1990s, low-status minorities were the targets of roughly three fourths 
of far-right violence (Buijs & van Donselaar 1994, pp. 69—70). Significantly, 
other immigrants—from the EU, Japan, or North Àmerica— were rarely vic- 
tims (Witte 1995, p. 494). 


The Koopmans Thesis 


Using social movement theory, Koopmans (1995) offers a two-part explana- 
tion for the sharp differences in racist violence among European nations. Fol- 
lowing from Thraenhardt's thesis, he first emphasizes the significance of po- 
litical elites who legitimize the far-right's view of foreigners as unbearable 
burdens. Thus, respected leaders convert the new minorities into problems. 
Such legitimization, Koopmans (1995, p. 34) argues, furthers far-right mobili- 
zation *... with high mass media resonance and favorable chances of substan- 
tive success." 

This mobilization, however, need not invoke violence. It also can activate 
far-right political parties, as shown by the Vlaams Blok and Front National. 
Hence, Koopmans' second point highlights the importance of such parties. He 
shows that countries with influential racist parties, such as France, Denmark, 
and Norway, have experienced relatively low levels of racist violence. By con- 
trast, countries with weak racist parties, such as Germany and the United King- 
dom, have had high levels. Even the Netherlands and Sweden, with low levels 
of general violence and without strong racist parties, have endured mid-levels 
of such racist attacks. 

This second part of Koopmans' thesis is problematic. Aggregate data from 
only a few nations provide the quantitative support. And Europe's experience 
between the world wars contradicts the argument. Fascist parties and political 
violence developed together during those turbulent years. Two divergent theo- 
ries of human aggression are at issue. Koopmans is following Freud's (1930) 
steam-boiler model of a finite amount of aggression. If it can be channeled into 
political action, then right-wing aggression against minorities should decline. 
Allport's (1954) feedback model of aggression predicts precisely the opposite. 
Have the far-right mobilize anti-immigrant feelings, and aggression will in- 
crease and spill over into more violence against the new minorities. 

Americans can readily apply the first part of Koopmans' thesis. The regres- 
sive 1980s and 1990s have seen the erosion in the United States of the sense of 
1nevitability of continued racial progress. Repeated attacks by leading public 
officials on civil rights, affirmative action, and immigration have produced an 
intergroup climate comparable to that shaped by Thatcher, Kohl, and Chirac in 
Europe (Kinder & Sanders 1996). And violence against minorities, especially 
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on college campuses, appears to have risen over these years (US Commission 
on Civil Rights 1990). Koopman’s emphasis on the role of political leaders in 
violence is also consistent with recent time-series analyses by Green and his 
colleagues (Green et al 1997) of ethnic hate crimes in New York City. They 
find little relationship between these crimes and such macroeconomic condi- 
tions as unemployment rates. 


CONCLUSIONS 


This chapter emphasizes the negative reactions to western Europe's new m1- 
norities. There also is a positive side to the picture. Native populations are 
slowly adjusting to the new cultures. Parisians have developed a taste for hum- 
mus, Berliners for kabab; the Indonesian rijsttafel is a basic of Dutch cuisine. 

More importantly, western Europe now boasts anti-racist movements. 
When the wave of atrocities against foreigners swept Germany in 1992, hun- 
dreds of thousands of Germans protested. With torchlight vigils and candle- 
light demonstrations, they countered “hatred and violence” in Berlin, Ham- 
burg, Munich, and other cities and towns. When the French government tried 
to discontinue the naturalization of French-born children of foreign parents, 
students protested with the slogan, “Don't touch my buddy.” 

Still, increased prejudice, direct and indirect discrimination, political oppo- 
sition, and extensive violence are major European reactions to the new minori- 
ties. These responses represent serious social problems worthy of study for 
practical, policy reasons. In addition, attention to these problems broadens our 
understanding of intergroup conflict in industrial societies. 

The research to date reveals remarkable agreement across societies. Despite 
sharp differences in national histories, political systems, and minorities, this 
new work reveals considerable consistency across the nations of western 
Europe. It also largely replicates and extends, rather than rebuts, the North 
American literature. 

This chapter has noted many such convergencies. Blatant and subtle preju- 
dice measures scale in nearly identical ways across four nations and diverse 
minority targets. The scales also share the same correlates across the seven 
samples, and these correlates replicate North American research. Both types of 
prejudice also predict attitudes toward immigration in all samples. And 
throughout the EU, attitudes toward immigrants hardened during the tense 
1989—1992 period. 

Moreover, a host of established social psychological processes, such as in- 
tergroup contact and group relative deprivation, operate in comparable ways in 
Europe. They typically act as proximal causes of prejudice, serving as media- 
tors for the distal effects of cultural and structural factors (Pettigrew et al 
1998). 
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The comparabilities extend to discrimination. Examples of both direct and 
indirect forms abound throughout western Europe. And efforts to combat dis- 
crimination have been weak across the continent. In those countries that resist 
granting citizenship to their new minorities, efficacious remediation of dis- 
crimination is extremely difficult. In those countries that have legislated 
against discrimination, the reliance upon individual complaints limits their ef- 
fectiveness—again comparable to the North American experience. 

Western European nations have seen the rise of far-right political opposi- 
tion to immigration and the new minorities. While none of these parties has 
risen to power, this right-wing surge has succeeded in moving the entire politi- 
cal spectrum to the right on the issue. The process closely resembles that of the 
Wallace movement in the United States and the current Hanson movement in 
Australia. 

Finally, similar patterns of racist violence have swept western Europe. 
While there is variability across nations in the number of reported incidents, 
the timing of this violence is similar. In particular, the elite framing of the im- 
migration discourse, especially defining the new minorities as unbearable bur- 
dens, relates closely to the violence patterns. 

The new European research supports and broadens earlier North American 
research in intergroup relations. It also extends our understanding in important 
ways. The chapter has described two of these extensions: the Thraenhardt the- 
sis on the political exploitation of xenophobia and the Koopmans thesis on the 
mobilization of racist violence. 
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ABSTRACT 


Despite substantial work ın a variety of disciplines, substantive areas, and 
geographical contexts, social memory studies is a nonparadigmatic, transdis- 
ciplinary, centerless enterprise. To remedy this relative disorganization, we 
(re-)construct out of the diversity of work addressing social memory a useful 
tradition, range of working definitions, and basis for future work. We trace 
lineages of the enterprise, review basic definitional disputes, outline a his- 
torical approach, and review sociological theories concerning the statics and 
dynamics of social memory. 


Introduction 


...the time 1s past 1n which time did not matter. Modern man no longer works 
at what cannot be abbreviated. . 
Paul Valéry 


Scholars have viewed social memory narrowly as a subfield of the sociology of 
knowledge (Swidler & Arditi 1994) and broadly as “the connective structure 
of societies" (Assmann 1992, p. 293). They have seen it as involving particular 
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sets of practices like commemoration and monument building and general 
forms like tradition, myth, or identity. They have approached it from sociol- 
ogy, history, literary criticism, anthropology, psychology, art history, and po- 
litical science, among other disciplines. They have studied it in simple and 
complex societies, from above and below, across the geographical spectrum. 
Social memory studies 1s nevertheless, or perhaps as a result, a nonparadig- 
matic, transdisciplinary, centerless enterprise. While this relative disorganiza- 
tion has been productive, it now seems possible to draw together some of these 
dispersed insights. Our goal in this essay is therefore to (re-)construct out of 
the diversity of work addressing social memory a useful tradition, range of 
working definitions, and basis for future work in a field that ironically has little 
organized memory of its own. 


Lineages 


Memory, of course, has been a major preoccupation for social thinkers since 
the Greeks. Yet it was not until the late nineteenth and earlier twentieth centu- 
ries that a distinctively social perspective on memory became prominent. The 
first explicit use of the term collective memory we could find was by Hugo von 
Hofmannsthal in 1902, who referred to “the dammed up force of our mysteri- 
ous ancestors within us” and “piled up layers of accumulated collective mem- 
ory” (Schieder 1978, p. 2). Contemporary usages are usually traced to Maurice 
Halbwachs, who published his landmark Social Frameworks of Memory in 
1925. Halbwachs’ Strasbourg colleague, historian Marc Bloch (1925), also 
used the term collective memory in 1925 as well as later in his book on feudal 
society (Bloch 1974 [1939]). The art historian Aby Warburg used the term so- 
cial memory to analyze artworks as repositories of history. Walter Benjamin as 
well, though he never used the terms social or collective memory, analyzed the 
material world as accumulated history, brilliantly emphasizing not only the 
manifold traces of the past in the artifacts of commodity culture, but the rela- 
tions between commodity culture and particular forms of historicity as well 
(Buck-Morss 1989). 

Bartlett (1932) is usually credited as the first modern psychologist to attend 
to the social dimensions of memory, attributing decisive importance to group 
dynamics in individual remembering. Anthropologist Evans-Pritchard (1940) 
developed a notion of “structural amnesia” in his famous study of the Nuer. In- 
teresting but largely forgotten works in other fields include Janet’s (1927) 
study of the evolution of memory and the concept of time, Vygotsky’s 1929 
claim that memory takes narrative form and is wholly shaped by cultural influ- 
ences (Bakhurst 1990), and Czarnowski’s 1919 Durkheimian analysis of festi- 
vals and rituals celebrating Saint Patrick (Schwartz 1996, pp. 275—76). 

In about the same period, American sociologists Cooley (1918) and Mead 
(1959 [1932]) also theorized about the social context of remembering, but their 
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important ideas—especially Mead's—have usually been ascribed to extraso- 
ciological interests (Maines et al 1983). Among the emerging European clas- 
sical theorists, Durkheim (1951 [1915]) is insightful about temporality but 
addresses memory directly only in his brief discussion of commemorative 
rituals, and there only as a feature of primitive societies. Social reproduction 
1s perhaps the central category of Marx's thought, but the Marxist tradition 
emphasizes the automatic and unconscious quality of the process; conscious 
attention to the past is characterized as an irrational residue of earlier social 
forms: “The tradition of the dead generations,” Marx (1852) writes in The 
Eighteenth Brumaire, “weighs like a nightmare on the minds of the living.” 
Simmel (1959) wrote that “All the uncertainties of change in time and the 
tragedy of loss associated with the past find in the ruin a coherent and unified 
expression.” This remark is prescient of later theories that see memory traces 
as evidence of loss, but Simmel did not develop it more than aphoristically. 
Weber, too, had little to say about memory, despite his interest in traditional 
legitimation: “by 1ts very ‘progressiveness’ [civilized society]... gives death 
the imprint of meaninglessness” (1946). Meaningful death 1s elusive because 
memory is inadequate to hold together the diversity of our life experiences. 
But this is an intriguing aside rather than the beginning ofa theory of memory. 

Shils (1981, p. 9) explains this shared neglect of tradition and memory by 
demonstrating how Weber and his contemporaries were the victims of their 
own overdrawn dichotomies. The classical theorists, Shils writes, “oversub- 
scribed to the naive view that modern society was on the road to traditionless- 
ness...." From such a perspective, an interest in how the past works on the 
present was antiquarian, or at least useful only as a contrast to the ways modern 
societies work. In his discussion of Ténnies, Terdiman (1993) notes how un- 
usual this lack of interest in memory and tradition was in a fin de siècle culture 
he describes as otherwise obsessed with memory. 

Between this early period of scattered work on the social foundations of 
memory and the present, relatively little attention was paid to the issue. Even 
major works like Lloyd Warner's The Living and the Dead (1959) were con- 
sidered exotic. Since about 1980, however, both the public and academia have 
become saturated with references to social or collective memory. Why has 
public interest in memory grown so in the last two decades? Kammen (1995) 
explains it in terms of the rise of multiculturalism, the fall of Communism, and 
a politics of victimization and regret, among other factors. Schwartz (1997) 
explains a decline 1n presidential reputations under the rubric of postmoder- 
nity. Nora, Hutton, Le Goff, Matsuda, and Huyssen pursue similar lines of ex- 
planation through an enterprise they label “the history of memory," which we 
review in greater detail below. 

It is a slightly different matter to trace the rise of scholarly interest in the 
memory problematic and the associated rediscovery of Halbwachs; analytical 
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paradigms appear to have at least a semiautonomous dynamic. Schwartz 
(1996) identifies three related aspects of 1960s—1970s intellectual culture that 
gave rise to interest in the social construction of the past. First, multicultural- 
ists identify historiography as a source of cultural domination and challenge 
dominant historical narratives in the name of repressed groups. Second, post- 
modernists attack the conceptual underpinnings of linear historicity, truth, and 
identity, thereby raising interest in the relations linking history, memory, and 
power. Finally, hegemony theorists provide a class-based account of the poli- 
tics of memory, highlighting memory contestation, popular memory, and the 
instrumentalization of the past. 

Hutton (1993) traces the memory problematic to the history of mentalities 
that has dominated French historiography since the 1960s. Foucault’s “ar- 
chaeological" stance provided general philosophical support for a desacraliza- 
tion of traditions. Historians like Ariés (1974) and Agulhon (1981), Hutton 
writes, began to study the history of commemorative practices, which they saw 
as mechanisms of political power, thus shifting historiographical interest from 
ideology to imagery and from meaning to manipulation. Writers like Hobs- 
bawm-—whose much-cited Invention of Tradition was a hallmark work ın this 
vein——extended this desacralization, seeing traditions as disingenuous efforts 
to secure political power. According to Hutton, it was on this foundation that 
interest ın Halbwachs revived; his apparently presentist position was seen as 
anticipating postmodernism. The recent effort by Nora to document all the 
"realms of memory" in French society (discussed below), Hutton argues, is the 
crowning moment in this tradition. 

Analogously, sociology has moved from the study of social structures and 
normative systems to that of "practice" (Bourdieu 1984, Ortner 1984), expand- 
ing the functionalist definition of culture as norms, values, and attitudes to cul- 
ture as the constitutive symbolic dimension of all social processes (Crane 
1994). The view that all meaning frameworks have histories and that explicitly 
past-oriented meaning frameworks are prominent modes of legitimation and 
explanation leads to increased interest in social memory because it raises ques- 
tions about the transmission, preservation, and alteration of these frameworks 
over time. Social memory studies also draw on the Mannheimian tradition in 
the sociology of knowledge and the Mertonian tradition in the sociology of sci- 
ence as well as on Berger & Luckmann's (1966) social constructionism, for 
which many sociologists of memory seem to have a special affinity. Social 
memory studies thus fit squarely within the reorientation of cultural sociology, 
much like that of recent historiography, from interest in “ideas developed by 
knowledge specialists... [to] structures of knowledge or consciousness that 
shape the thinking of laypersons” (Swidler & Arditi 1994) as well as drawing 
on older sociological interests. 


SOCIAL MEMORY STUDIES 109 


Delimiting the Field 


Through this reconstruction of intellectual lineages for social memory studies, 
it is possible to limn a conceptual core for our contemporary efforts. The place 
to begin is Durkheim's response to philosophical positions, in contradistinc- 
tion to which he demanded a social account of temporality. For Halbwachs, 
Durkheim's student, this meant that studying memory was not a matter of re- 
flecting philosophically on inherent properties ofthe subjective mind; memory 
is a matter of how minds work together in society, how their operations are not 
simply mediated but are structured by social arrangements: "[I]t 1s in society 
that people normally acquire their memories. It is also in society that they re- 
call, recognize, and localize their memories...” (Halbwachs 1992, p. 38). 
Nonetheless, because questions of social memory involve issues of temporal- 
ity, mind, and, as we see shortly, narrative and historicity, social studies of 
memory have remained close to philosophy. 

Halbwachs developed his concept of collective memory not only beyond 
philosophy but against psychology, though the very idea of a social memory 
appropriates psychological terminology. Freud had argued that the individ- 
ual’s unconscious acts as a repository for all past experiences. Forgetting, 
rather than remembering, is what takes work in the form of repression and the 
substitution of “screen” memories that block access to more disturbing ones. 
Halbwachs rejects this Freudian and other purely psychological accounts. He 
argues that it is impossible for individuals to remember in any coherent and 
persistent fashion outside of their group contexts: “There is [thus] no point,” he 
argues, “in seeking where... [memories] are preserved in my brain or in some 
nook of my mind to which I alone have access: for they are recalled by me ex- 
ternally, and the groups of which I am a part at any time give me the means to 
reconstruct them...” (Halbwachs 1992, p. 38). 

Writers in other traditions have rejected an individual-psychological ap- 
proach to memory as well: Gadamer (1979), for instance, has written, “It is 
time to rescue the phenomenon of memory from being regarded as a psycho- 
logical faculty and to see it as an essential element of the finite historical being 
of man” (Hutton 1993). Contemporary psychologists Middleton & Edwards 
(1990) as well encourage their discipline to recover Bartlett’s and Halbwachs’ 
more social insights. Neisser (1982) implicitly calls for a more social perspec- 
tive on memory when he argues that the standard experimental methods of 
cognitive psychology have been inadequate due to the artificiality of the ex- 
perimental setting. Pennebaker, Paez, and Rimé (1997) take an explicitly so- 
cial psychological perspective in their studies of collective memory of politi- 
cal events. Preserving more of the individualist perspective, some authors 
have suggested possible benefits of linking social, neuropsychological, and 
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paleoanthropological inquiries into memory (Schachter 1995, Leroi-Gourhan 
1993). 

The third, and perhaps most contested, boundary for social memory studies 
is its relation to historiography. Halbwachs was very decisive about his solu- 
tion: History is dead memory, a way of preserving pasts to which we no longer 
have an "organic" experiential relation. On the surface, this understanding of 
the distinction negates the self-image of historiography as the more important 
or appropriate attitude toward the past: History's epistemological claim is de- 
valued in favor of memory's meaningfulness. At a deeper level, however, the 
distinction 1s the same that traditional historians would draw between history 
and memory: Only the former is engaged in a search for truth. In this vein, Ye- 
rushalmi (1982, p. 95) draws a sharp contrast between Jewish memory and 
Jewish historiography, arguing that until the eighteenth century, the former ex- 
cluded the latter. On the one hand, he laments this condition because, as he 
writes, “...collective memory... is drastically selective. Certain memories live 
on; the rest are winnowed out, repressed, or simply discarded by a process of 
natural selection which the historian, uninvited, disturbs and reverses." On the 
other hand, he critiques history for its sterile posture of distance from meaning 
and relevance: “...Jewish historiography can never substitute for Jewish mem- 
ory.... A historiography that does not aspire to be memorable is in peril of be- 
coming a rampant growth" (Yerushalmi 1982, p. 101). 

Recent approaches within historiography, however, have critiqued this un- 
derstanding of the relations between history and memory. First, as historiogra- 
phy has broadened its focus from the official to the social and cultural, memory 
has become central “evidence.” Theorists now recognize, moreover, that 
memory frequently employs history in its service: Professional historians have 
often provided political legitimation for nationalism and other more recon- 
structive identity struggles. This involvement calls into question not only the 
success of historians in being objective, but the very notion of objectivity itself 
(Novick 1988). Furthermore, postmodernists have challenged the "truth- 
claim" of professional historiography by questioning the distinction between 
knowledge and interpretation, and derivatively between history and memory 
(White 1973, Veyne 1984). Philosophers have argued forcefully that histori- 
ography constructs as much as uncovers the “truths” it pursues (Novick 1988, 
Iggers 1997). History is written by people in the present for particular pur- 
poses, and the selection and interpretation of “sources” are always arbitrary. If 
“experience,” moreover, 1s always embedded in and occurs through narrative 
frames, then there 1s no primal, unmediated experience that can be recovered. 
The distinction between history and memory in such accounts is a matter of 
disciplinary power rather than of epistemological privilege. Burke (1989) 
therefore refers to history as social memory, using the term as “a convenient 
piece of shorthand which sums up the rather complex process of selection and 
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interpretation." Hutton (1993) titles his book History as an Art of Memory. 
Schwartz argues that “Sharp opposition between history and collective mem- 
ory has been our Achilles Heel, causing us to assert unwillingly, and often de- 
spite ourselves, that what 1s not historical must be 'invented' or 'con- 
structed’—-which transforms collective memory study into a kind of cynical 
muckraking" (B Schwartz, personal communication). 

Before turning to the history of memory and to the substantive results of so- 
cial memory studies, it 1s possible, on the basis ofthe preceding reconstruction, 
to define some of the basic concepts for such an inquiry. Halbwachs distin- 
guished among autobiographical memory, historical memory, history, and col- 
lective memory. Autobiographical memory is memory of those events that we 
ourselves experience, while historical memory is memory that reaches us only 
through historical records. History is the remembered past to which we no 
longer have an “organic” relation—the past that is no longer an important part 
of our lives—while collective memory 1s the active past that forms our identi- 
ties. Memory inevitably gives way to history as we lose touch with our pasts. 
Historical memory, however, can be either organic or dead: We can celebrate 
even what we did not directly experience, keeping the given past alive for us, 
or itcan be alive only in historical records, so-called graveyards of knowledge. 

Though collective memory does seem to take on a life of its own, Halb- 
wachs reminds that it is only individuals who remember, even if they do much 
of this remembering together. And Coser (1992) points out that, while Durk- 
heim writes “Society” with a capital S, Halbwachs employs the more cautious 
“groups.” Halbwachs characterized collective memory as plural; he shows that 
shared memories can be effective markers of social differentiation (Wood 
1994, p. 126). Some authors, nonetheless, detect the collectivist overtones of 
the Durkheimian tradition in Halbwachs' work. Fentress & Wickham (1992) 
worry about “a concept of collective consciousness curiously disconnected 
from the actual thought processes of any particular person,” which risks ren- 
dering “the individual a sort of automaton, passively obeying the interiorized 
collective will.” 

As a result of these problems, some authors prefer other terms to “collective 
memory." Sturkin (1997) defines "cultural memory" as “memory that is 
shared outside the avenues of formal historical discourse yet is entangled with 
cultural products and imbued with cultural meaning." Fentress & Wickham 
(1992) refer to “social memory” rather than to collective memory. Olick & 
Levy (1997) refer to “images of the past" as parts of “political cultural pro- 
files." Assmann (1992) distinguishes among four modes of memory in an ef- 
fort to capture the range of memory problematics: 1. mimetic memory—the 
transmission of practical knowledge from the past; 2. material memory—the 
history contained in objects; 3. communicative memory—the residues of the 
past in language and communication, including the very ability to communi- 
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cate in language; and 4. cultural memory—the transmission of meanings from 
the past, that is, explicit historical reference and consciousness. 

Critics who charge that "collective memory" over-totalizes prefer a prolif- 
eration of more specific terms to capture the ongoing contest over images of 
the past: official memory, vernacular memory, public memory, popular mem- 
ory, local memory, family memory, historical memory, cultural memory, etc. 
Still others argue that a collective memory concept has nothing to add to older 
formulations like myth, tradition, custom, and historical consciousness. Gedi 
& Elam (1996) hold that overuse of the term collective memory is “an act of in- 
trusion... forcing itself like a molten rock into an earlier formation... unavoid- 
ably obliterating fine distinctions...." If defined too broadly, as the pattern- 
maintenance function of society or as social reproduction per se, what is not 
social memory? On the other hand, Burke (1989) argues that "1f we refuse to 
use such terms, we are in danger of failing to notice the different ways in which 
the ideas of individuals are influenced by the groups to which they belong." 
Schwartz uses Herbert Blumer's classical distinction between operational and 
sensitizing concepts, and classifies collective memory as of the latter sort. He 
argues that collective memory “is not an alternative to history (or historical 
memory) but is rather shaped by 1t as well as by commemorative symbolism 
and ritual. To conceive collective memory in this way sensitizes us to reality 
while encouraging us to recognize the many things we can do to reality inter- 
pretively" (personal communication). 

In this review, we refer to “social memory studies" as a general rubric for 
inquiry into the varieties of forms through which we are shaped by the past, 
conscious and unconscious, public and private, material and communicative, 
consensual and challenged. We refer to distinct sets of mnemonic practices in 
various social sites, rather than to collective memory as a thing. This approach, 
we argue, enables us to identify ways in which past and present are intertwined 
without reifying a mystical group mind and without including absolutely every- 
thing in the enterprise. Methodologically, Olick (n.d.) and Schudson (1992) 
suggest specifying the different institutional fields that produce memory such 
as politics and the arts; Olick (n.d.) and Reichel (1995) theorize the varying 
links between media and memory; Wagner-Pacifici (1996) places special em- 
phasis on memory's cultural forms. 


The History of Memory 


Instead of trying to fix conceptual distinctions theoretically, many scholars 
have called for a historical approach to social memory, one that sees such dis- 
tinctions as emerging in particular times and locations and for particular pur- 
poses. As Matsuda (1996, p. 16) puts it, *... memory has too often become an- 
other analytical category to impose on the past; the point should be to re- 
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historicize memory and see how it is so inextricably part of the past." Yates’ 
(1966) The Art of Memory is the seminal work in this vein, charting the links 
between memory systems and particular historical orders. Yates traces trans- 
formations in ars memoria—the rhetorical art of memorizing through spatial 
images—from Roman times through the Renaissance, where the art of mem- 
ory persisted in the humanist tradition despite its decline due to the spread of 
the printing press. Coleman (1992) as well offers a comprehensive history of 
theories of memory from antiquity through later medieval times, noting the 
particular sophistication of medieval theories, which address the reconstruc- 
tion of narratives. Following Yates’ lead, Carruthers (1990) demonstrates the 
persistence of memory training even with the spread of texts, which resulted in 
the highly mixed oral-literate nature of medieval cultures. Indeed, the dissemi- 
nation of written materials, she argues, occurred through memorization and 
oral transmission. 

Yet for Yates, Coleman, and Carruthers, memorization remains central: In 
earlier centuries, this form of remembering was of greater significance than ıt 
is today. But even for those periods, an analysis of ars memoria reveals little 
about popular memory due to its elite focus. In response, Geary (1994, p. 8) fo- 
cuses on ordinary medieval people who, he argues, were actively engaged in 
creating their past: "Individuals and communities copied, abridged, and re- 
vised archival records, liturgical texts, literary documents, doing so with refer- 
ence to physical reminders from previous generations and a fluid oral tradi- 
tion....” Geary (1994) also breaks with Yates et al by expanding the definition 
of memory to include textual transmission as well as oral memorization. 

But while ancient arts of memory do persist in the interstices of later mne- 
monic forms (Matsuda 1996, Casey 1987, Zonabend 1984), it is virtually im- 
possible to discuss collective memory without highlighting historical develop- 
ments in the material means of memory transmission. While new technologi- 
cal means of recording the past are often seen as "artificial," with time they are 
incorporated into the accepted cultural construct of memory. By extension, 
contemporary interest in the social bases of memory may be traced at least 
partly to a historical shift of memory from the mind to external loci; without 
externalization of memory in "artificial" sites, the social location of memory is 
not as clear. Even in earlier cultures, however, direct attention to material 
forms of memory can yield important insights. 

Assmann (1992), for instance, argues that while Babylonian, Egyptian, 
Greek, and Jewish cultures all developed the technical means for preserving 
the past (word, text, writing, and book), only the Greek and Jewish persisted as 
living traditions, due to the peculiarities of their historical experiences. In the 
Jewish case, where the entire weight of cultural continuity rested on funda- 
mental texts, everything depended on keeping them alive. This led to the de- 
velopment of a new form of reading—commentary—and a new kind of his- 
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torical consciousness. This study thus contradicts technologically determinist 
claims about the importance of the alphabet for cultural continuity: While the 
development of an alphabet was important, it was not sufficient, nor were its 
effects uniform. 

Epochal generalizations about the developing relations between memory and 
technologies of communication have nonetheless described a broad shift from 
orality to literacy over millennia. Founding this tradition, McLuhan theorized 
the effects of electronic communications on typographic culture within a history 
that includes the move from manuscript to print culture two centuries earlier and 
from orality to literacy a millennium before that (Hutton 1993). Subsequently, 
Ong traced a long-range pattern from orality to manuscript literacy, to print cul- 
ture, to media culture, drawing out implications for memory. The invention of 
writing in antiquity was the seed for the rise of more abstract thinking. Because 
that capability resided in the hands ofa small elite, however, it was not until the 
vast expansion of literacy in the seventeenth and eighteenth centuries that the 
profound possibilities of written culture became a dominant cultural form. In 
the process, memory became a public affair, and a problematic one at that. 

On the basis of this kind of macro-historical theory, many contemporary 
scholars of memory work with an image of oral culture as richly expressive 
and of literate culture as detached and introspective (Goody 1986). “Memory,” 
as Hutton (1993, p. 16) puts it, “ first conceived as a repetition, is eventually re- 
conceived as a recollection." Where Proust revelled in the "involuntary" mem- 
ory evoked by the taste of a tea cookie, macro-historical theorists of memory 
describe modern memory as predominantly “voluntary” or active. Hobsbawm 
& Ranger (1983), for instance, distinguish sharply between custom and tradi- 
tion. The former is the unproblematic sense of continuity that undergirds the 
gradual, living changes of "traditional" societies. Tradition, in contrast, aims 
at invariance and 1s the product of explicit ideologies. 

In an important synthesis, Le Goff (1992) follows Leroi-Gourhan (1993 
[1964—1965]) in identifying five distinct periods in the history of memory. 


* First, peoples without writing possessed what Le Goff calls “ethnic 
memory," in which memory practices are not highly developed arts; Le 
Goff therefore see societies without writing as free, creative, and vital. 


* Second, the move from prehistory to Antiquity involved the develop- 
ment from orality to writing, though writing never fully supplanted oral 
transmission. This new condition enabled two important new mnemonic 
practices—commemoration and documentary recording—associated 
with emerging city structures. 


* Third, memory in the Middle Ages involved "the Christianization of 
memory and of mnemotechnology, the division of collective memory be- 
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tween a circular liturgical memory and a lay memory little influenced by 
chronology, the development of the memory of the dead and especially 
of dead saints...” (p. 68). 


* Fourth, memory as it developed from the Renaissance to the present in- 
volved the gradual revolution in memory brought about by the printing 
press, which required the long development of a middle class readership 
to complete its effect. With a "progressive exteriorization of individual 
memory," the collective memory grew to such a degree that the individ- 
ual could no longer assimilate it in toto. In the nineteenth century, Ro- 
manticism added to a growing fervor for commemorating, and prolifer- 
ated multifarious forms for doing so, including coins, medals, postage 
stamps, statuary, inscriptions, and souvenirs. In the same period, we wit- 
ness the birth of archives, libraries, and museums, reflecting the interests 
of different nations seeking to build shared identities within their citizen- 
ries. 


Finally, changes in the twentieth century constituted another genuine 
revolution in memory, the most important element of which was the in- 
vention of electronic means of recording and transmitting information, 
which not only change the way we remember, but provide new ways of 
conceptualizing memory. Not only computers but image processing and 
the immune system (Sturkin 1997) now serve as basic models and meta- 
phors for thinking about memory. 


À key point in many histories of memory is that a significant transformation 
in the experience of time occurred at some debatable point between the Middle 
Ages and the nineteenth century. Many authors describe an existential crisis 
arising out of the increased possibility for abstract thought discussed above, 
out of accelerating change resulting from increased industrialization and ur- 
banization, as well as out of the resultant decline of religious worldviews and 
of traditional forms of political authority. Koselleck (1985), for instance, de- 
scribes a shift from a “space of experience” to a “horizon of expectation.” 
Through the seventeenth and eighteenth centuries, a wide variety of new expe- 
riences and events produced an awareness of the “noncontemporaneity of the 
contemporaneous,” which led, in turn, to a sense of a human future and of the 
distinctness of history. Ariés’s (1974) work on attitudes toward death and dy- 
ing in Western culture, as well, attributes the rising importance and frequency 
of commemorative practices in the nineteenth century to an increased sense of 
change: The past was no longer felt to be immediately present but was some- 
thing that required preservation and recovery. 

Hobsbawm (1972) describes the rise of linear historical consciousness as a 
necessary solution to the existential problems of rapid transformation: “Para- 
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doxically, the past remains the most useful analytical tool for coping with con- 
stant change." Thompson (1995) attributes a similar dynamic largely to trans- 
formations in media technology, which extended individuals' experiences be- 
yond the sphere of day-to-day encounters: "The process of self-formation 
[thus] became more reflexive and open-ended." Jacoby (1975) and Berman 
(1982), among others, attribute to late modernity a condition—at least partly 
related to rampant commodification—that makes it harder and harder to relate 
to the past, producing what Jacoby calls "social amnesia." When the past is no 
longer obviously connected to the present, memory becomes of diagnostic im- 
portance, as Terdiman (1993) puts it. Yerushalmi (1982) specifies much of this 
in his discussion of Jewish memory: "The modern effort to reconstruct the 
Jewish past begins at a time that witnesses a sharp break in the continuity of 
Jewish living and hence also an ever-growing decay of Jewish group mem- 
ory.” In sum, according to Schieder (1978, p. 8), “.. . historical thought served a 
compensating function making up for the actual loss of history by exaggerat- 
ing a consciousness of it." 

The connection between nationalism and social memory appears to have 
been especially important. Cressy (1989) traces a new kind of memory in Eng- 
land to the seventeenth century, a memory that gave expression to a mythic and 
patriotic sense of national identity: “The calendar became an important instru- 
ment for declaring and disseminating a distinctively Protestant national cul- 
ture... binding the nation to the ruling dynasty and securing it through an in- 
spiring providential 1nterpretation of English history" (Cressy 1989, p. xi). 
Calendars map the basic temporal structures of societies, enabling and con- 
straining their abilities to remember different pasts (Zerubavel 1981); many 
have noted how a new calendar served French Republican leaders as effective 
symbolic markers for their break from the old regime (Hunt 1984, Ferguson 
1994). More generally, Gillis (1994) links the construction of national memo- 
ries to what he calls a cult of new beginnings. 

Anderson (1991) combines insights into the spread of print literacy, capital- 
ist commerce, and the decline of religious worldviews to explain the rise of 
historicizing national identities as a pervasive modern principle. In his ac- 
count, the transformation of temporality and the associated rise of interest in 
the past made it possible “to think the nation." Print capitalism, according to 
Anderson, was the principal agent of this transformation toward what Benja- 
min (1968) called the *empty, homogeneous time" of the nation-state. Felt 
communities of fate were secured across wide territories by newspapers and 
novels, which produced shared culture among people who would never meet. 
As a result, in Smith's (1986) words, “ethnic nationalism has become a 'surro- 
gate’ religion which aims to overcome the sense of futility engendered by the 
removal of any vision of an existence after death, by linking individuals to per- 
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sisting communities whose generations form indissoluble links in a chain of 
memories and identities." 

Others have given similar insights a more critical turn. Boyarin (1994), for 
instance, points out that statist ideologies "involve a particularly potent ma- 
nipulation of dimensionalities of space and time, invoking rhetorically fixed 
national identities to legitimate their monopoly on administrative control." Re- 
nan is remembered from the nineteenth century for having pointed out the 
ways in which national identities combine remembering and forgetting, with 
greater emphasis on the latter: They forget that they are not inevitable and that 
their internal fissures may be as significant as their external boundaries (An- 
derson 1991). Duara (1995) writes that the relationship between linear his- 
toricity and the nation-state is repressive: “National history secures for the 
contested and contingent nation the false unity of a self-same, national subject 
evolving through time...” enabling “conquests of Historical awareness over 
other, ‘nonprogressive’ modes of time.” 

Many writers have pointed to the ways in which national states consciously 
manipulate and exploit professional history. Smith (1986) writes that “One 
sign of the formation of the nation out of the protonation is the shifting of the 
center of collective memory from the temple and its priesthood to the univer- 
sity and its scholarly community.” Breisbach (1994) shows that “Historians 
were called on to mediate between the demands for change and the equally 
strong desire to see the continuity of past, present, and future preserved.... Pre- 
sented by careful scholars with great eloquence, these histories became popu- 
lar possessions rather than scholarly curiosa.” Novick (1988) shows how, de- 
spite protestations of disinterest and objectivity, American historical scholar- 
ship has always been inextricably tied to contemporary political problematics. 
More generally, Lévi-Strauss (1979) argues that “In our own societies, history 
has replaced mythology and fulfills the same function....” Nevertheless, Noir- 
iel (1996) has argued that “the degree to which commemoration of historical 
origins is essential for building political consensus may be treated as a vari- 
able.” Smith (1986) as well warns against either overgeneralizing or over- 
specifying the urge toward historical commemoration: Nostalgia exists in 
every society; in the era of the nation-state nostalgia for the “ethnic past” has 
merely become more acute. 

In a major contribution, Hobsbawm (1983) notes the proliferation in the 
mid to late nineteenth century of state-led efforts to “invent” useful traditions 
to shore up their fading legitimacy. Particularly after 1870, in conjunction with 
the emergence of mass politics, political leaders “rediscovered the importance 
of ‘irrational’ elements in the maintenance of the social fabric and the social 
order.” Many thinkers thus advocated the construction of a new “civil relig- 
ion;” successful leaders sought to imbue educational institutions with nation- 
alist content, to expand public ceremony, and to mass produce public monu- 
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ments. This impulse spread to nonstate groups as well, producing an interest in 
genealogies of all sorts, 1ncluding social registers for the upper classes. With 
more emphasis on local cultures in the nation-building process, Confino 
(1997) shows how German nation-building in the nineteenth century (and by 
extension other nation-building projects elsewhere) required assimilating di- 
verse regional memories into one coherent national identity, which was suc- 
cessful only when the national was mediated through local categories. 

Not all thinkers of the nineteenth century, however, championed this prolif- 
eration of history. Nietzsche (1983) was highly critical of his age’s pervasive 
production of the past in both its scientific and monumental guises. While rec- 
ognizing that it 1s the power to bring the past to life that constitutes the human- 
ity of human beings, Nietzsche also claims that an excess of history can de- 
stroy our humanity: “The past," he writes, “has to be forgotten if it is not to be- 
come the gravedigger of the present." Many contemporary writers on social 
memory quote Borges's short story about “Funes the Memorious,” depicting 
the agony ofa young man who has lost the ability to forget. Nietzsche sees his- 
toricism's scientific attitude as producing “dead” knowledge, while monumen- 
tal history “inspires the courageous to foolhardiness and the inspired to fanati- 
cism." In another well-known essay, Butterfield (1965 [1931]) warns against 
an overly interested approach to history writing, what he calls “Whig history,” 
which produces “a story which is the ratification if not the glorification of the 
present." 

Moving to a slightly later period, historians of memory emphasize the im- 
portance of the First World War for perceptions of temporality and the status 
of national memory. Benjamin in particular portrayed the War experience as a 
decisive moment in a longer-term trend, typified by a decline of storytelling, a 
process which he sees, however, as “only a concomitant symptom of the secu- 
lar productive forces of history." The conditions for storytelling, ^woven thou- 
sands of years ago in the ambience of the oldest forms of craftsmanship" have 
lost their most basic support “because there is no more weaving and spinning 
to go on while... [stories] are being listened to.” “Boredom,” Benjamin (1968) 
writes, “is the dream bird that hatches the egg of experience. A rustling in the 
leaves drives him away.... With this, the gift for listening is lost and the com- 
munity of listeners disappears." For Benjamin, the First World War brought 
this process into a new phase: “...never has experience been contradicted more 
thoroughly than strategic experience by tactical warfare, economic experience 
by inflation, bodily experience by mechanical warfare, moral experience by 
those in power." This cataclysm left people not only without the conditions for 
telling stories but without communicable experiences to tell. 

With less apocalyptic vision, other writers as well have noted a change in 
the form of memory after the War. Mosse (1990), in a study of "The Myth of 
the War Experience," notes that the burial of the dead and commemoration be- 
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came the tasks of specially formed national commissions during the War. 
Paradoxically, just as the effect of war was felt more brutally than ever among 
civilian populations, the tasks of consolation were made more public than ever 
before. As a result, “The memory of the war was refashioned into a sacred ex- 
perience which provided the nation with a new depth of religious feeling, put- 
ting at its disposal ever-present saints and martyrs, places of worship, and a 
heritage to emulate." Additionally, Winter (1995) explores the new forms of 
war memorial that emerged to appropriate the devastation of total war for na- 
tional purposes, though he emphasizes the proliferation of more introspective 
forms too. Gillis (1994) notes that World War I marked a massive democrati- 
zation of the cult of the dead. In a detailed study of war literature that emerged 
in Great Britain, Fussell (1975) characterizes this corpus as comprising a pecu- 
larly “modern” form of memory. 

While the First World War thus created new attitudes toward both the pres- 
ent and the past, the Holocaust is said to have produced an even more decisive 
crisis of representation. “We are dealing," writes Friedlander (1992, p. 3), 
“with an event which tests our traditional conceptual and representational cate- 
gories, an ‘event at the limits." There is the oft-quoted remark of Adorno that 
to write lyric poetry after Auschwitz 1s barbaric. By extension, many have por- 
trayed the Holocaust as challenging the validity of any totalizing view of his- 
tory (Friedlander 1992, p. 5). In German intellectual circles, this issue has 
spawned an ongoing debate between those who maintain that the Holocaust 
was unique and those who call for "historicizing" it. The literature on German 
debates about the Nazi past is too voluminous to even begin to report. Good 
starting places are Maier (1988), Evans (1989), and Olick (1993). 

Gillis (1994), Mosse (1990), Young (1993), and Koonz (1994) document 
changes in war memorials after the Second World War, noting that the mem- 
ory of war is now understood in a new way. Where nationalist leaders ex- 
ploited a cult of the war dead after the First World War to foment further na- 
tionalist sentiment, memory after Auschwitz and Hiroshima has often been 
more problematic (Bosworth 1993). Where earlier monuments aimed to exac- 
erbate resentment for future campaigns, many later monuments worked to 
erase a clear burden. Indeed, Young (1992) goes so far as to implicate the very 
form of monumentalization in the forgetting: “...once we assign monumental 
form to memory, we have to some degree divested ourselves of the obligation 
to remember." Adorno (1967) had many years earlier pointed out the associa- 
tion between the words “museum” and “mausoleum.” 

While some authors make the Holocaust the decisive turning point, others 
see in it merely one last and most horrible stage in a development already un- 
der way—one which included recognition of horrors of colonialism, two 
world wars, racism, environmental damage, etc—on the road to postmoder- 
nity. In either case, from early intimations of postmodernism in Heidegger, 
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through the critical theory of Benjamin and Adorno, to the postmodernist the- 
ory of the 1980s, the connected problems of time and memory have been cen- 
tral issues for cultural criticism. Postmodern writers have addressed the rup- 
tured sense of continuity and the multiple temporalities that they see as charac- 
terizing our highly mediated society. While many of these theorists have made 
important insights, we focus very briefly on only two here, Huyssen and Nora. 
Fora critique of the postmodernist account of memory as overly unilinear in its 
critique of unilinearity, see Schwartz (1997). 

In Twilight Memories, Andreas Huyssen (1995) characterizes the situation 
of memory in postmodernity as paradoxical. He notes the simultaneous popu- 
larity of museums and the resurgence of the monument and the memorial at the 
same time there 1s an “undisputed waning of history and historical conscious- 
ness." Novelty, he says, 1s now associated with new versions of the past rather 
than with visions of the future. This memory boom, however, is not to be con- 
fused with the historical fever to legitimatize nation-states that Nietzsche de- 
rided. "In comparison, the mnemonic convulsions of our culture seem chaotic, 
fragmentary, and free-floating." 

His pessimism, however, is not complete, and his analysis is perceptive: 
“The current obsession with memory," Huyssen writes, "is not simply a func- 
tion of the fin de siécle syndrome, another symptom of postmodern pastiche. 
Instead, it is a sign of the crisis of that structure of temporality that marked the 
age of modernity with its celebration of the new as utopian, as radically and ir- 
reducibly other." Where Benjamin and Adorno ascribed the contemporary cri- 
sis of memory to the forgetting at the center of the commodity, Huyssen relates 
the further development of media technologies since their time to “the evident 
crisis of the ideology of progress and modernization and to the fading of a 
whole tradition of teleological philosophies of history." As a result, the post- 
modern condition of memory is not wholly one of loss: “Thus the shift from 
history to memory represents a welcome critique of compromised teleological 
notions of history rather than being simply anti-historical, relativistic, or sub- 
jective." The contemporary crisis of memory, Huyssen argues, "represents the 
attempt to slow down information processing, to resist the dissolution of time 
in the synchronicity of the archive, to recover a mode of contemplation outside 
the universe of simulation and fast-speed information and cable networks, to 
claim some anchoring space in a world of puzzling and often threatening het- 
erogeneity, non-synchronicity, and information overload." Where postmodern 
antiepistemology derides any easy correspondence between experience and 
memory, Huyssen characterizes that fissure as “a powerful stimulant for cul- 
tural and artistic creativity." 

French historian Pierre Nora (1992), leading theoretician and editor of a 
massive seven-volume project on “places” or “lieux” of French memory, also 
begins by observing the paradoxes of memory in postmodernity. “We speak so 
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much of memory," he writes, "because there is so little of it left." Nora can in 
this way be seen as the true heir to Halbwachs, who noted the passing of mem- 
ory into history as we lose a living relation to the past, though Nora sees this 
process as even more dramatic and irreversible, and as more clearly political, 
than Halbwachs did. Where premodern societies live within the continuous 
past, contemporary societies have separated memory from the continuity of 
social reproduction; memory is now a matter of explicit signs, not of implicit 
meanings. We now compartmentalize memory as a mode of experience; our 
only recourse is to represent and invent what we can no longer spontaneously 
experience (Wood 1994). Nora thus contrasts contemporary "lieux" or places 
of memory to earlier lived “milieux.” The former are impoverished versions of 
the latter: “If we were able to live within memory, we would not have needed 
to consecrate lieux de mémoire in its name." 

Nora's project is to catalogue all of these places of memory in French soci- 
ety. He organizes the analyses around three principles which he sees as layered 
on top of one another in telling ways: the Republic, the Nation, and "Les Fran- 
ces." For Nora, this ordering represents a historical progression from unity, 
through uncertainty, to multiplicity. The peculiar status of the second, the 
memory-nation, is the linchpin. In its ascendancy, the memory-nation relied 
on national historical narratives to provide continuity through identity. In the 
nineteenth century, change was still slow enough that states could control it 
through historiography. But, Nora argues, the nation as a foundation of iden- 
tity has eroded as the state has ceded power to society. The nation itself, earlier 
shored up by memory, now appears as a mere memory trace. In contrast to 
theories of the nation discussed above, Nora thus sees the nation-state as de- 
clining in salience, the last incarnation of the unification of memory and his- 
tory, a form in which history could provide the social cohesion memory no 
longer could. History too has now lost its temporary ability to transmit values 
with pedagogical authority (Wood 1994). All that is left, as Hutton (1993) 
characterizes Nora’s project, is to autopsy the past, at best to celebrate its cele- 
brations. 

Many writers, however, note that older styles of memory persist in the inter- 
stices of modern historical consciousness, and they see in this coexistence an 
indictment of clear dichotomy between memory and history (Zonabend 1984), 
while others worry that such accounts are inappropriately teleological. Rappa- 
port (1990), moreover, charges that the dichotomy between oral and written 
modes of memory serves a colonialist mentality that devalues non-Western 
forms of remembering. These critiques notwithstanding, it is clear that the 
situation of memory has changed rather dramatically both over the centuries 
and especially in the last few decades. Nora's approach raises as many ques- 
tions as it answers: Given the scope of the cataloguing project, what is not a 
lieu de mémoire? Isn't the attempt to catalogue even what one recognizes as 
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impoverished memory traces itself a political act of recuperation (Englund 
1992)? Nonetheless, Nora's theory remains the most comprehensive empirical 
effort to confront the contemporary situation of memory. Where Yates sug- 
gests a history of memory, Nora takes it to a programmatic level, 


Processes of Social Memory: Statics and Dynamics 


The history of memory outlined above makes clear that memory is not an un- 
changing vessel for carrying the past into the present; memory is a process, not 
a thing, and it works differently at different points in time (Zelizer 1995). Soci- 
ologists of memory have thus sought to specify at a more middle level how 
memory processes operate within specific social institutions. Here the quintes- 
sential sociological issues of power, stratification, and contestation are central. 
One merit of Nora's project is that it reminds us of all the different places his- 
torical 1magery and practices occur. Sociologists have long studied many of 
these sites and practices in an attempt to understand the statics and dynamics of 
social reproduction. Key terms here include identity, contestation, malleabil- 
ity, and persistence. 


Identity 


Erikson (1959) is usually credited with introducing the identity concept to de- 
scribe psychological development over the life course: personal identity, de- 
spite periodic crises, is self-sameness over time. À recent narrative turn in 
identity theory, however, has warned against essentializing identities; instead, 
they are seen as ongoing processes of construction in narrative form (Bruner 
1990, Calhoun 1994). As MacIntyre (1984, p. 218) puts it, “...all attempts to 
elucidate the notion of personal identity [and, by extension, group identity] in- 
dependently of and in isolation from the notions of narrative... are bound to 
fail." As Hall writes, "Identities [personal or collective] are the names we give 
to the different ways we are positioned by, and position ourselves in, the narra- 
tives of the past" (Huyssen 1995, p. 1). Identities are projects and practices, not 
properties. 

Many recent social theorists have extended the concept to the social level, 
noting, as MacIntyre does, that "The possession of an historical identity and 
the possession of a social identity coincide.” As Hobsbawm (1972) writes, “To 
be a member of any human community is to situate oneself with regard to one's 
(its) past, if only by rejecting it." In a much-quoted formulation, Bellah and co- 
authors (1985, p. 153) write that “Communities... have a history—in an im- 
portant sense are constituted by their past—and for this reason we can speak of 
a real community as a ‘community of memory,’ one that does not forget its 
past. In order not to forget that past, a community is involved in retelling its 
story, its constitutive narrative." "The temporal dimension of pastness," Wal- 
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lerstem (1991, p. 78) adds, “is central and inherent in the concept of people- 
hood." 

A crucial link between the literatures on identity and memory concerns how 
we acquire our personal and social identities. Halbwachs paid particular atten- 
tion to the role of the family in shaping how we construct the past; Zerubavel 
(1996) generalizes this insight by discussing what he calls “mnemonic sociali- 
zation” into “mnemonic communities.” “All subsequent interpretations of our 
early ‘recollections,’” he writes, “are only reinterpretations of the way they 
were originally experienced and remembered within the context of our fam- 
ily.” Much of what we “remember,” moreover, we did not experience as indi- 
viduals. "Indeed," Zerubavel writes, "being social presupposes the ability to 
experience events that happened to groups and communities to which we be- 
long long before we joined them as if they were part of our own past...." This 
“sociobiographical memory" is the mechanism through which we feel pride, 
pain, or shame with regard to events that happened to our groups before we 
joined them. 

Another central conceptual tool for analyzing this intersection between in- 
dividual and collective identities as constituted through shared memories is 
that of generations. Mannheim's (1952 [1928]) seminal work here argues that 
social and political events shape generations through major shared experience 
during their formative years. It is not an accident that the notion of generations 
flowered in Europe after World War I. The war created a felt community of ex- 
perience especially among the soldiers. Wohl (1979) refers to “the generation 
of 1914," whose members, following Mannheim’s theory, were in the right 
place (total war) at the right time (when they were young men) to form a par- 
ticularly clearly demarked generation. Schuman & Scott (1989) develop and 
test Mannheim's theories about the connections between generations and so- 
cial memory by asking different age cohorts to rank various historical events in 
terms of their perceived importance. Striking response differences, they argue, 
demonstrate that generational differences in memory are strong, that adoles- 
cence and early adulthood are indeed the primary periods for "generational im- 
printing in the sense of political memories," and that later memories can best 
be understood in terms of earlier experiences. Shils (1981) points out that new 
generations define themselves against their elders and thus bear a different re- 
lation to the past than previous generations. Theorists of nationalism have 
pointed out (Smith 1986, Anderson 1991) that nationalist movements almost 
always centrally involve youth movements. 

In the previous section, we saw that the nation-state, despite internal divi- 
sions along generational, regional, religious, and other lines, has often claimed 
to be the primary form of organizing social identity. But in the history of mem- 
ory, this remains a broad epochal generalization. Sociologists have studied at a 
closer level how this aim to dominate identity manifests itself through collec- 
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tive mnemonic processes. Collective memory does not merely reflect past ex- 
periences (accurately or not); it has an orientational function (Schwartz 
1996a). As Schwartz puts it, "collective memory is both a mirror and a lamp— 
a model of and a model for society" (personal communucation). 

National and other identities are established and maintained through a vari- 
ety of mnemonic sites, practices, and forms. Spillman (1997), for instance, 
compares the role of centennial and bicentennial celebrations in Australia and 
the United States, demonstrating the different ways each of these countries 
used commemorations to address diverse issues. Hunt (1984) explores cloth- 
ing, medals, language, and other symbolic forms as well, as markers of mem- 
ory and identity. Cerulo (1995) examines national anthems, though she does 
not make the connection to social memory explicit. Schwartz (1990, 1991) and 
Goode (1978), notto mention the classical work of Thomas Carlyle (1901), ex- 
amine the role of heroes in national identity. Coontz (1992) documents nostal- 
gia for earlier *golden ages," as does Smith (1986), who notes the importance 
of origin myths in creating and maintaining identities. Zerubavel (1995) dis- 
cusses national mythologies, and associated physical places, as ordering prin- 
ciples for articulations of national memory. Ferguson (1994), Boyer (1994), 
and Haydon (1995) examine the ways urban form embodies a vision of identity 
by inscribing the past. Muensterberger (1994) explores collecting as a mne- 
monic practice, while Bennett (1995) undertakes a history of the museum and 
of the world's fair as sites for articulating national identities. Rochberg-Halton 
(1986) studies the role of household objects in establishing the relations be- 
tween memory and identity. Olick (1993, 1997) examines political speech as 
mnemonic practice. Dayan & Katz (1992) see the mass media as producing 
electronic monuments that compete with history writing to frame social mem- 
ory; Lang & Lang (1989) examine the role of the news in forging collective 
memory. The literature on film and national memory is enormous. 

There are many important case studies of the conhections between memory 
and particular national identities, emphasizing both positive and negative as- 
pects of those historical formations. Rousso (1991) and Maier (1988) study 
how France and Germany respectively confront their difficult legacies of 
World War II. Roniger (1997) and Nino (1996) look at how various countries 
in the Southern Cone, including Chile, Uruguay, and Argentina, confront the 
memory of human rights violations. Aguilar (1997) discusses the problematic 
legacy of the Spanish Civil War for subsequent regimes. Trouillot (1995) ana- 
lyzes the complex memories of colonialism and contemporary struggles over 
historical identity in Haiti. Sturkin (1997) examines memories of Vietnam and 
of the AIDS epidemic in the United States as sites for working out national 
identity. Buruma (1994) compares Japanese and German memories of the Sec- 
ond World War. Gluck (1993) examines different epochs in Japanese memory. 
A special issue of the journal Representations edited by Greenblatt, Rev, and 
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Starn (1995) studies struggles in Eastern Europe with memory of pre-1989 
events; a volume edited by Watson (1994) examines memory under state so- 
clalism; Tumarkin (1994) has written on the cult of World War II in Russia. 
Borneman (1997) analyzes how various central and eastern European coun- 
tries have settled accounts after 1989 with their Communist pasts, as does Ro- 
senberg (1995). 

Herzfeld (1991) uses ethnography to analyze the complex negotiations be- 
tween local and national memories in a Greek town, as does Confino (1997) 
for Wilhelmine Germany. Kammen (1991, 1978) is the preeminent analyst of 
American memory, documenting the changing forms of historical conscious- 
ness in Àmerican history; Thelen (1989) has edited an important volume on 
American memory. Mudimbe & Jewsiewicki (1993) explore history making in 
Africa, while Fabre & O'Meally (1994) explore the role of memory in African- 
American identity. Segev (1993) and Zerubavel (1995) present rich studies of 
memory in Israeli collective 1dentity. Darian-Smith & Hamilton (1994) have 
edited a volume on Australian memory. A massive tome sponsored by the 
Holocaust Memorial Center (Wyman 1996) contains monographs on how 24 
different nations reacted to the Holocaust, exploring how that event shaped na- 
tional identities and vice versa. Rapaport (1997) and Irwin-Zarecka (1989) re- 
search how Jews in contemporary Germany and Poland, respectively, live in 
the lands of their former oppressors. 

One particularly vibrant area of debate concerning the connection between 
memory and identity has been scholarship concerning heritage. The classic 
work in this field is Lowenthal's (1985) monumental The Past Is a Foreign 
Country, which documents the ways in which national pasts, particularly their 
built and geographical remains, are reshaped according to present interests. 
Heritage sites appear to be especially useful for dramatizing the historicity of 
the nation, particularly in Great Britain. Indeed, the heritage debate has been 
most heated 1n Britain, which possesses an elaborate physical legacy and 
which has a substantial history of propagating it. Barthel (1996) compares 
such debates in Great Britain and the United States, finding more democratic 
and inclusive versions in the United States and more elitist programs in the 
United Kingdom; Koshar (1994) studies such processes in West Germany. 
Wright (1985) has provided a detailed account of British debates and, along 
with Hewison (1987), criticizes the nostalgia “industry” for producing mind- 
less, pacifying, and politically conservative commodifications of the national 
past. Samuel (1994), on the other hand, sees a redemptive potential in the heri- 
tage industry; to argue otherwise is to denigrate popular consciousness in the 
name of the people. Many others have documented the commercialization of 
nostalgia, particularly in the form of reconstructed villages, Disneyland ver- 
sions of the American past, and souvenirs (Davis 1979). For an informative re- 
view of the literature on nostalgia, see Vromen (1993). 
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National identities, of course, are not the only ones available, but he- 
gemonic forces within the nation-state have worked hard to appropriate and s1- 
lence other identity discourses. As Alonso (1988) explains, “Historical chro- 
nologies solder a multiplicity of personal, local, and regional historicities and 
transform them into a unitary, national time." Almost all of the studies just 
mentioned, however, highlight not the simplicity or unity of national narra- 
tives, but the fact that they are essentially contested: Memory sites and mem- 
ory practices are central loci for ongoing struggles over identity. As Sturkin 
(1997) puts it, “Cultural memory is a field of cultural negotiation through 
which different stories vie for a place in history." This sounds almost too be- 
nign and passive; people and groups fight hard for their stories. Contestation is 
clearly at the center of both memory and identity. 


Contestation 


Memory contestation takes place from above and below, from both center and 
periphery. The critical theorists of nationalism discussed above noticed that 
nation-states not only use history for their purposes, but make historiography 
into a nationalist enterprise. Indeed, Wilson et al (1996) document how na- 
tional governments seek to control the very "sources" of professional histori- 
ography by limiting access to state archives. “The hegemony of modern 
nation-states,” Alonso writes (1988), “and the legitimacy which accrues to the 
groups and classes that control their apparatuses, are critically constituted by 
representations of a national past.” This is accomplished through the related 
strategies of naturalization, departicularization, and idealization. This means 
that history as a tool has until recently not been easily available to competing 
identities; as a result, other claimants often have not been very good competi- 
tors. As Foucault (1977) put it, “Since memory is actually a very important 
factor in struggle... if one controls people's memory, one controls their dyna- 
mism." 

In order to resist the disciplinary power of nationalist historiography, Fou- 
cault articulated a notion of *counter-memory," referring to memories that dif- 
fer from, and often challenge, dominant discourses. In a similar vein, many 
scholars in the past several decades have sought to redirect historical inquiry 
away from the nation-state as a unit of analysis in favor of groups and perspec- 
tives excluded from traditional accounts. Feminist historians, for instance, 
have sought to recover the repressed history of women that has been left out of 
“official” histories. Oral historians (Thompson 1988) see their enterprise as a 
way of giving “history back to the people in their own words:” It claims to be 
more democratic than other historiographical methodologies because it pro- 
vides an alternative viewpoint from below, a viewpoint that conventional 
methodology disenfranchises. Feminists and oral historians, in fact, have often 
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combined their efforts to recover the lost voices of ordinary women's experi- 
ence (Leydesdorff et al 1996). 

The dominance of national memory over other memories thus not only ex- 
cludes other contestants for control over the national identity but maintains the 
primacy of national over other kinds of identity for primary allegiance. On the 
other hand, counter-memory approaches often employ a rather essentialist no- 
tion of authenticity: Counter-memory is sometimes seen as protected and sepa- 
rate from hegemonic forms. To resist this, the Popular Memory Group (John- 
son et al 1982) and others employing the concept of popular memory (Lipsitz 
1990, Wallace 1996) have sought to understand popular memory in terms of 
ongoing processes of contestation and resistance, a relatively free space of 
reading and reaction in which official and unofficial, public and private, inter- 
penetrate. Dominant memory is not monolithic, nor is popular memory purely 
authentic. Some historians of gender argue that “focusing exclusively on the 
dominated makes a full understanding even of the origins and maintenance of 
their subordination impossible" (Leydesdorff et al 1996). “The intertwining of 
power and memory,” these authors write, “is very subtle... when we as oral 
historians try to rescue and interpret these memories... we also inevitably 
transform their standing and character as memories.” 

Achieving mnemonic consensus is thus rarely easy, charged as it is with 
transcending the infinity of differences that constitute and are constituted by it. 
As Thelen (1989) puts it, “The struggle for possession and interpretation of 
memory is rooted in the conflict and interplay among social, political, and cul- 
tural interests..." “It 1s a product," Irwin-Zarecka (1994) writes, “of a great 
deal of work by large numbers of people." Many empirical studies have fo- 
cused on these struggles, especially over the most public representations of the 
past to be found in monuments and museums. Wagner-Pacifici & Schwartz 
(1991), for instance, introduce a notion of cultural entrepreneurship in their 
study ofthe struggle for the Vietnam Veterans Memorial. The essays in Linen- 
thal & Engelhardt (1996) document so-called history wars over a proposed 
Smithsonian exhibit on the bombing of Hiroshima. Savage (1994) character- 
izes American Civil War memorials as involving *systematic cultural repres- 
sion, carried out in the guise of reconciliation and harmony." But as much as 
monumental form strives for permanence, Savage argues, “the cultural contest 
that monuments seem to settle need not end once they are built and dedicated": 
Even the most concrete presentations of the past are polysemic. Along these 
lines, Sandage (1993) showed how African-American civil rights groups ap- 
propriated the Lincoln Memorial as a site for articulating their claims. 

Groups can also use images of the past and struggles over history as vehi- 
cles for establishing their power or, perversely, lack of power. Baker (1985) 
demonstrates how revolutionaries in eighteenth-century France used memory 
to achieve their movement aims. Bodnar (1992) shows how various ethnic 
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groups in the 1920s used national holidays to articulate their versions of 
American identity and to claim a unique place in the cultural landscape. 
Takezawa (1995) documents the Japanese-American movement for redressing 
internment during World War II. De Oliver (1996) analyzes the struggle over 
containing alternative voices at the Alamo historic site. There are numerous 
other such studies of contestation and social movements demanding an inver- 
sion of some past or a new monumental interpretation. 


Malleability and Persistence 


Noticing the ways in which images of the past are the products of contestation 
has led varieties of both constructionists and deconstructionists to emphasize 
that the past is produced in the present and is thus malleable. A powerful line of 
so-called “presentism” runs through much of the sociological work on mem- 
ory, work which documents the ways in which images of the past change over 
time, how groups use the past for present purposes, and that the past is a par- 
ticularly useful resource for expressing interests. Within presentism, however, 
itis possible to emphasize either instrumental or meaning dimensions of mem- 
ory: The former see memory entrepreneurship as a manipulation ofthe past for 
particular purposes where the latter see selective memory as an inevitable con- 
sequence of the fact that we interpret the world—including the past—on the 
basis of our own experience and within cultural frameworks. Hobsbawm & 
Ranger (1983) are paradigmatic examples of instrumental presentism, while 
Mead (1959 [1932]) and Mannheim (1956) manifest the latter variety; 
Halbwachs (1992) combines elements of both. 

In response to the perceived ascendancy of presentism in social memory 
studies, a number of authors highlight limits on the malleability of the past. 
Schudson (1989, 1992), for instance, argues that “The past is in some respects, 
and under some conditions, highly resistant to efforts to make it over." Three 
factors, according to Schudson, limit our abilities to change the past: The 
structure of available pasts presents only some pasts and poses limits to the de- 
gree to which they can be changed, while placing other pasts beyond our per- 
ceptual reach; the structure of individual choice makes some pasts unavoidable 
and others impossible to face; and the structure of social conflict over the past 
means that we are not always the ones deciding which pasts to remember and 
which to forget. In his important study of Watergate in American memory as 
well, Schudson (1992) responds to the instrumentalist claim of infinite malle- 
ability by taking the limits on such manipulability into account. 

In contrast, Schwartz (1991, 1996) responds to the cultural claim of malle- 
ability: Certain pasts, while somewhat malleable, are remarkably persistent 
over time. Schuman & Scott (1989) and Middleton & Edwards (1990) empha- 
size individual-level processes like generational experience and personal iden- 
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tities, while Schwartz and others look at institutional factors. Schwartz docu- 
ments how certain meanings remain relevant over long periods of time despite 
superficial changes in the reading of those meanings as well as in their institu- 
tional contexts; certain pasts are constitutive elements of political cultures, and 
these endure as long as the political culture is not completely superseded. Even 
when radically new pasts emerge, they often superimpose themselves over 
older versions without eliminating them. As Shils (1981) sums up a more ex- 
treme version of this argument, "traditional patterns of belief and conduct... 
are very insistent; they will not wholly release their grip on those who would 
suspend or abolish them." Shils also emphasizes that the persistence ofthe past 
can be an explicit goal, as in self-conscious orthodoxies, thus mixing instru- 
mentalist and culturalist positions. 

A third aspect of memory persistence and malleability could be termed “in- 
ertial." Halbwachs discusses how memories become generalized over time 
into an “imago,” a generalized memory trace. Conservatives see this kind of 
change in memory as decay and seek ways to recuperate the lost past. Shils 
(1981) and Assmann (1992), among others, discuss pasts that remain the same 
simply out of the force of habit. Connerton's (1989) focus on memory "incor- 
porated” in bodily practices (as opposed to that “inscribed” in print, encyclo- 
pedias, indexes, etc) suggests this sort of inertia. Drawing on Elias's civilizing 
process and Bourdieu's work on consumption, he argues for a “mnemonics of 
the body." 

Table 1 summarizes this discussion by identifying six ideal types of mne- 
monic malleability or persistence: 1. instrumental persistence—actors inten- 
tionally seek to maintain a particular version of the past, as in orthodoxy or 
movements to maintain or recover a past; 2. cultural persistence—a particular 
past perseveres because it remains relevant for later cultural formations (more 
general images are more likely to adapt to new contexts than more specific 
ones); 3. inertial persistence—a particular past occurs when we reproduce a 
version of the past by sheer force of habit; 4. instrumental change—we inten- 
tionally change an image of the past for particular reasons in the present 
(though we cannot always predict the results of our efforts); 5. cultural 


Table 1 
Instrumental Cultural Inertial 

Persistence Self-conscious orthodoxy, Continued relevance, ^ Habit, routine, repeti- 
conservatism, heritage canon tion, custom 
movements 

Change Revisionism, memory en- Irrelevance, paradigm Decay, atrophy, satura- 
trepreneurship, redress change, discovery of tion, accidental loss, 
movements, legitima- new facts death 


tion, invented tradition 
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change—-a particular past no longer fits with present understandings or other- 
wise loses relevance for the present; and 6. inertial change—the carriers of par- 
ticular images die, our mnemonic capabilities decay, or we simply forget. 

One problem with instrumentalist and inertial accounts of change or persis- 
tence is that they locate the statics and dynamics of memory outside of the 
memories themselves. Even cultural approaches, while emphasizing mean- 
ings, seem to locate the source of change in political cultures, not in the textual 
dynamics of memory itself. To remedy this exogenous bias, Olick & Levy 
(1997) argue that whether a particular past persists or not depends partly on 
how it is constituted: Mythic logics produce taboos and duties while rational 
logics produce prohibitions and requirements; the former require bold acts of 
transgression to change them while the latter can be changed through argument 
and refutation. Olick (1997) also refines culturalist theories of mnemonic dy- 
namics by pointing out that cultural persistence or change is not merely a mat- 
ter of fit or lack of fit with context, nor of whether a particular memory is de- 
fensible as accurate or authentic: Memories form genres that unfold over time 
by referring not only to their contexts and to the "original" event, but to their 
own histories and memories as texts. 


Reputations and Knowledge 


Two empirical areas that have seen a great deal of work on the statics and dy- 
namics of memory are reputation studies and the sociology of knowledge. 
While the sociology of reputation is not an entirely new field—biographies 
have always dealt with image—it concentrates in an unprecedented way on 
how individuals are remembered rather than how they lived. Often these stud- 
ies begin by recognizing that reputations are only loosely correlated with life- 
time achievements; not only talent, but social factors play a role in securing 
and maintaining the outstanding reputations of individuals. 

Various authors, including especially Lang & Lang (1988) and Taylor 
(1996), appear to have converged on explanations in terms of four basic factors 
in reputational dynamics. First, personal strategizing and political maneuver- 
ing by the figure or his or her representatives can control the figure's image. 
Strategies include seclusion, autobiography, flamboyance, forging relations 
with patrons, etc. Lang & Lang (1988) note that in order to catch the eyes of 
dealers, collectors, curators, and art historians, artists have to produce a critical 
mass of work, keep adequate records to guarantee proper attribution, and make 
arrangements for custodianship. Institutional practices like record-keeping 
also favor some kinds of reputation for preservation over others. 

Second, image is influenced by those with a stake in a particular reputation. 
Latour (1988), for instance, argues that Pasteur's reputation spread as doctors 
and hygienists aligned themselves with the scientist's cause to promote their 
own professional interests. De Nora (1995) shows how Beethoven benefited 
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from admiration by an aristocratic musical public that was pivotal in shaping 
the narrative of his genius. Schudson (1990) reveals how Ronald Reagan's 
popularity was constructed by an oral political culture in Washington, DC. 
Donoghue (1996) argues that in the eighteenth-century literary market, it was 
reviewers, and not the authors themselves, who were chiefly responsible for 
creating narratives of literary careers. 

Third, the impact of cultural factors on reputation is theorized in two dis- 
tinct ways. For those who view cultural patterns as distinct from talent, culture 
works to boost some reputations at the expense of others as a matter of happen- 
stance. Lang & Lang (1990), for instance, discuss the influence of ideology on 
reputation, noting that artists’ achievements are refracted “through their avail- 
ability as a symbolic form for a variety of sentiments that may have nothing to 
do directly with art.” In a different approach, cultural theorists (Bourdieu 
1984) who focus on the constructed nature of taste show how reputations de- 
pend on struggles for prestige and position that employ culture as a tool and as 
markers. De Nora (1995, p. 180), for instance, details how Beethoven’s pro- 
motion of a sturdier piano helped create new aesthetic categories within which 
his music “could make sense and be positively evaluated.” Reviews of Beetho- 
ven's work became more favorable as personal idiosyncrasies and creativity 
came to be valued in the music-critical discourse as a “higher” form of music. 
Similarly, Tuchman & Fortin (1984) show how women were “edged out” of 
the literary field. As men entered the field, the novel rose to high-culture status 
while the themes and styles of women’s writing were demoted to popular cul- 
ture. Zelizer (1992) shows how professional journalists used the Kennedy as- 
sassination and their eulogies of him to advance their own authoritative status. 

A fourth line of work on reputation shows how reputations respond to 
broader narrative and cultural forms. We have a tendency, theorists of reputa- 
tion argue, to exaggerate both greatness and evil. One ofthe earliest reputation 
studies (Connelly 1977), for 1nstance, demonstrates how the figure of Robert 
E. Lee was invested with extraordinary import because, across many years, his 
image acted as a palimpsest on which contemporary concerns could be written 
and rewritten. Schwartz (1990) documents how Lincoln's image changed from 
one of simple accessibility to that of a “remote and dignified personage.” 
Schwartz (1991) also shows how Washington's reputational malleability is 
tied up with the changing needs of different periods in American history, while 
maintaining a common core of continuity. In his now classic study, Pelikan 
(1985) shows how the varying representation of Jesus reflected particular pre- 
occupations of different societies in different periods. 

From the other side, Ducharme & Fine (1995) show how villains—in their 
case, Benedict Arnold —are remembered in much worse light than their deeds 
might warrant; Johnson (1995) discusses the rehabilitation of Richard Nixon. 
Additionally, Taylor (1996, p. 261) notes that “we are particularly prone to re- 
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member stimuli associated with major changes in a niche." It helps one's repu- 
tation, Latour (1988) argues, to be associated with the dawn of a new era in a 
particular field. Others have carried this line of argument even further, arguing 
that the very possibility of distinctive reputations is tied up with the career of 
the genius notion in the culture at large. Heinich (1996), for instance, inquires 
into the history of the category of talent in her study of The Glory of Van Gogh, 
as does Gamson (1994) in his study of the category of celebrity in American 
culture. 

Another empirical field where sociologists have studied the dynamics of 
memory is the sociology of science and knowledge. Research on scientific 
knowledge is concerned largely with the problem of forgetting, while investi- 
gations of canon formation ask why particular kinds of knowledge are remem- 
bered. Kuhn (1962) argues that knowledge depends on paradigmatic conven- 
tions: Normal science within paradigms cumulates, but knowledge in different 
(later) paradigms is incommensurable. Gans (1992) argues, however, that 
even within paradigms knowledge does not cumulate: Younger researchers re- 
peat findings already reported by earlier practitioners. Gans labels this process 
"sociological amnesia" and attributes it to institutional factors including aca- 
demia's reward structure, myths of scientific progress, and the lack of mecha- 
nisms for punishing unintentional borrowers. Gans is aware that Sorokin 
(1956) had already made the same point. Merton (1973) also documented how 
scientists tend to forget the origins of their ideas: Scientists are committed to 
an ideology of original discovery, “which is embedded in all the forms of insti- 
tutional life, along with prizes and naming of plants, animals, measurements, 
and even diseases after scientists." Good ideas, moreover, are the products of 
climates of opinion; it is thus often pointless to ask who said something first, as 
Merton demonstrates in his study of the expression, “on the shoulders of gi- 
ants" (1985 [1965]). 

Some works, figures, and ideas, however, tend to be singled out and pre- 
served as particularly important. Just as for reputation, one important factor is 
how closely associated with a major rupture a work or idea is, in Kuhn's terms, 
how close to a paradigm shift. As Levine (1995) notes, moreover, disciplines 
have collective memories that establish and maintain their identities. Douglas 
(1986) argues that a theory is more likely to be remembered if it shares basic 
formulae, equations, and rules of thumb with theories in other fields: "On the 
principle of cognitive coherence, a theory that is going to gain a permanent 
place in the public repertoire of what 1s known will need to interlock with other 
kinds of theories." Tuchman & Fortin (1984), as already noted, show that these 
processes can be political: Ideas propagated by powerful groups and for pow- 
erful purposes are more likely to be remembered than others. Taylor (1996), 
among many others, documents the underlying political function of canons as 
well. 
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Efforts to revise established knowledge orthodoxies can be tied up with 
overt political constellations and purposes as well. In the past decade, there has 
been a proliferation of "historians' disputes," public debates about both the 
content and meaning of history 1n several nations, including Germany (Maier 
1988), France (Kaplan 1995), and Israel (Ram 1995). The rise of interest in 
memory, the challenges to the distinction between history and memory, and 
the status of memory in postmodern society reviewed in this essay are part of 
the explanation for these debates. It is interesting to note that the term "revi- 
sionism" is of relatively recent vintage (Novick 1988); revisionism now 1s 
taken to refer to those who deny taken-for-granted truths—like the occurrence 
ofa German genocide of Jews in the 1940s —though it originally meant any at- 
tempt to challenge commonly held beliefs about the past, including the “nor- 
mal" growth of scientific knowledge. Studies of more extreme revisionisms 
(Lipstadt 1993, Vidal-Naquet 1992) document both that history can serve as a 
surrogate in more general political struggles as well as that particular images 
of the past have symbolic import that extends beyond questions of their truth. 


Future Directions 


The field of social memory studies is clearly vast, the forms of memory work 
diverse. It should be clear, however, that similar themes occur in different dis- 
ciplinary, substantive, and geographic areas. Given the epochal character of 
memory demonstrated by the history of memory, this should not be surprising. 
As Valéry put it in our epigraph, the time is past in which time did not matter; 
we experience this condition as a problem of memory. In recent times, the so- 
lution has been to designate sites to stand in for lost authenticity, to proliferate 
new narratives when the old ones no longer satisfy, and to abbreviate—as 
here—in face of insurmountable accumulation. Social memory studies are 
therefore part of the phenomenon they seek to explain. But the explanation, we 
have tried to show, need not be relentlessly particular: The enterprise does 
have clear lineages just as the phenomenon has general contours, and explana- 
tions ofthe various processes are transposable across cases (e.g. Germany and- 
the United States) and across issues (e.g. reputation, monuments, and knowl- 
edge). 

We conclude by pointing to four areas that emerge in social memory studies 
as possible future directions. First, social memory studies clearly ftt with the 
widespread interest in identity in recent social and sociological discourse. 
Memory is a central, if not the central, medium through which identities are 
constituted. Inquiries into identity and memory are being related; these re- 
search programs, we hope, will illuminate further how, when, and why indi- 
viduals and groups turn toward their pasts. 

Interestingly, both fields have attacked the tendency to reify their founda- 
tional concepts; both identity and memory, we now recognize, are ongoing 
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processes, not possessions or properties. This leads directly to our second 
point: Many sociologists (McDonald 1996) have recently argued that the basic 
categories of sociological analysis reify temporality. These critiques call for a 
"processual" or “narrative” approach to social processes, arguing that socio- 
logical strategies for approaching the past have heretofore been ahistorical. 
Appreciating the changing history of mnemonic practices as well as the ways 
in which these changing practices are the media of temporal experience can 
and should play a role in this search for a more genuinely historical sociology. 

A third point is more practical. As the belief that history and memory are 
epistemologically and ontologically distinct has eroded and as competing 
pasts and historical legitimacy claims have proliferated, the ability to settle 
conflicts over how to represent the past has also diminished. We have certainly 
gotten better at deconstructing identitarian mythology, but this has left us with 
a not-always-productive cacophony of claims vying for dominance. While the 
recent period of inquiry into the history and dynamics of social memory seems 
to have fed this deconstructive mood (and vice versa), we hope that further re- 
search will help us resolve some of the conflicts or at least manage them better. 

Our fourth point ıs connected to this: Until now, it seems that macrosocio- 
logical theories of modernity and postmodernity have done well at explaining 
memory as a dependent variable. But social memory is largely absent from our 
grandest theories. The diverse memory practices reviewed here are not merely 
symptoms of modernity and postmodernity—they are modernity and post- 
modernity. Sociological theorists, we argue, thus have a great deal to learn 
from theorists like Nora, Huyssen, and Koselleck. Recent work by Giddens 
(1990, 1994) has moved in this direction. More studies of the way memory 
practices are central features of modern and postmodern life and more theories 
of these epochal forms with memory at their heart should follow. In sum, all 
four of these points demonstrate that social memory studies is not a narrow 
subfield; it provides powerful lessons for sociology as a whole, is consonant 
with the reformation of historical sociology now occurring, and provides im- 
portant insights for theory at the broadest level. Sociology, we argue, cannot 
afford to forget memory. 
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ABSTRACT 


Divergent conceptualizations of the recent changes 1n work organization that 
have accompanied computerization 1nclude neo-Bravermanian analyses, 
postindustrial analyses, and contingency analyses. To make sense of these 
differing views, the paper surveys sociological research on computerization 
and its impact on three analytically separate dimensions of the workplace: 
organizational restructuring, changes in worker skill, and power and author- 
ity relationships, The review reveals that computerized work organizations 
typically have fewer hierarchical levels, a bifurcated workforce, frequently 
with race and sex segregation, a less formal structure, and diminished use of 
internal labor markets and reliance instead on external credentialing. Vari- 
able patterns of centralization and decentralization occur, and workplace 
power relationships interact with technological change to produce variable 
political outcomes. With regard to worker skills, recent evidence suggests 
aggregate upskilling with some deskilling and skill bifurcation. Future re- 
search should more closely analyze the process of technological design and 
implementation. 


INTRODUCTION 


During the latter part of the twentieth century, the implementation of comput- 
erized technology and advanced information systems, in conjunction with re- 
lated socioeconomic changes, has led to a fundamental restructuring of work 
organizations. Contemporary sociologists, trying to understand this "second 
industrial divide" (Piore & Sabel 1984), as nineteenth century sociologists 
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tried to understand the first industrial revolution, have generated widely diver- 
gent conceptualizations. On the one hand, some social scientists have found 
that recent social and technological changes have created a centralized, neo- 
Taylorist work organization, deskilling of the labor process, and reduced 
worker autonomy (Braverman 1974, Feldberg & Glenn 1987, Kraft 1977, 
1979, Noble 1977, 1984, Shaiken 1984, Zimbalist 1979). Conversely, other 
social scientists have concluded that the transformation of production has pro- 
moted a postindustrial or postbureaucratic work organization characterized by 
decentralization and reduction in hierarchy, upskilling of work and a centrality 
of knowledge workers, and democratization and increased worker autonomy 
(Attewell 1992, Bell 1973, Block 1990, Clegg 1990, Hirschhorn 1984, Piore & 
Sabel 1984). Still others have abandoned the search for general theory con- 
cerning the impact of technological change on the organization of production 
' in favor of "contextualist" or "contingent" approaches that explore the micro- 
dynamics of workplace changes (Adler 1992b, Barley 1986, Cornfield 1987, 
Gallie 1978, Kelley 1990, Thomas 1994). These "contingency theorists" have 
argued that "the quest for general trends about the development of skill levels, 
or general conclusions about the impact of technologies, 1s likely to be ın vain 
and misleading" (Wood 1989, p. 4; see also Vallas & Beck 1996, p. 341 ff fora 
good review). 

We now have a considerable body of empirical research to help make sense 
of these divergent views. Can we substantiate any empirical or theoretical gen- 
eralization concerning the impact of advanced technology on the workplace? 
If recent changes in the organization of production are contingent, can we be- 
gin to specify the contingencies? To answer these questions, this paper reviews 
recent sociological work on computerization and its impact on each of three 
analytically separate (although practically intertwined) dimensions of the 
workplace: organizational restructuring, changes in worker skill, and power 
and authority relationships. Although computerization is a global phenome- 
non, space constraints necessitate an emphasis here on US workplaces. I con- 
clude with an analysis and interpretation. 


ORGANIZATIONAL RESTRUCTURING 


If it is true, as Salzman & Rosenthal (1994, p. 4) contend, that “workplaces are 
shaped by the design of the technology used,” then we would expect work or- 
ganizations centered around computerized systems of production and informa- 
tion to differ structurally from those utilizing other technologies. However, 
computerized systems are more flexible and variable than previous types of 
workplace technology; it is therefore not surprising to find variable patterns of 
implementation. Nonetheless, some trends are observable in the empirical lit- 
erature. 


COMPUTERS AND WORK 143 


Reduction in the Hierarchical Division of Labor 


In contrast to the specialized division of labor with fine vertical gradations, 
characteristic of classical bureaucracy and many production workplaces, some 
researchers have found that computerization has correlated with fewer hierar- 
chical levels and a "two-tier" occupational structure (Baran 1987, Colclough 
& Tolbert 1992, Kanter 1983, 1984, 1991, Hodson 1985, 1988, Noyelle 1987, 
Smith 1993, 1996, Wellman et al 1996, Zuboff 1988). Middle-level positions 
are reduced or eliminated, with a credential barrier typically separating the two 
sectors of the polarized corporation, with emphasis on external credentialing 
and recruitment from without (Kanter 1984, 1991, Burris 1983a,b, 1993). This 
trend appears particularly pronounced in high-tech firms (Kanter 1983, Hod- 
son 1988). However, some observers disagree about whether this reduction in 
vertical hierarchy is causally related to computerization; Kling (1996a, p. 282) 
finds that what he terms “delayering” is also found in low-tech organizations 
and is more related to broader cultural and political changes than to computeri- 
zation per se. 

It appears that the extent and shape of the restructuring of the division of 
labor are dependent on several factors: the specific type of technology, 
managerial policies and choices, and the nature of the service or product (Salz- 
man & Rosenthal 1994). When production of goods or services can be stan- 
dardized and performed largely by the computer system, the workforce is 
likely to be more bifurcated into skilled technical workers and a smaller 
number of less skilled production or clerical workers, whereas when comput- 
erization is less extensive, the polarization may be less pronounced (Baran 
1987, Barley & Orr 1997). With even more extensive computerization, “su- 
perautomation," comes a dramatic reduction in the size of the production 
workforce (Indergaard & Cushion 1987, Office of Technology Assessment 
1984, Shaiken 1984). 

One corollary associated with this polarization is that the organizational 
structure becomes less formal. In contrast to conventional bureaucracies, with 
their clearly defined chains of responsibility and communication channels, re- 
structured bureaucracies rely more on ad hoc teams and task forces (Hodson 
1988, Kanter 1983). What has been called an “adhocracy” (Mintzberg 1979) or 
"matrix organization" (Kanter 1983) emerges in some contexts: organic, inte- 
grative, flexible, adaptive, and innovative workplaces with a constantly chang- 
ing internal structure. For expert-sector workers (managers and professionals), 
at least, bureaucratic constraints are relaxed to allow for creativity and flexibil- 
ity (see Burris 1993). Indeed, occupational segregation appears sometimes to 
be accompanied by a pronounced bifurcation of working conditions. Hodson 
(1988) found that worker autonomy, input into decision-making, and salaries 
were dramatically different for high-tech engineers and workers. 
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A second corollary is that conventional internal labor markets tend to erode, 
along with mobility prospects for non-expert sector workers (Baran 1987, 
Hodson 1988, Kanter 1991, Noyelle 1987). External credentialing and promo- 
tion from without frequently substitute for in-house training and promotion. In 
some firms, diminished mobility prospects have translated into impaired 
worker motivation (Burris 1983a,b, Hodson 1988, Noyelle 1987); some firms 
have experimented with quality control circles and other worker participation 
experiments to compensate for the lack of training and mobility opportunities 
(see Noyelle 1987). 

A final corollary of polarization has been the reinforcement of race and sex 
segregation in some firms (Burris 1989, Cockburn 1985, Colclough & Tolbert 
1992, Hodson 1988, Noyelle 1987, Smith 1993), The erosion of internal labor 
markets has had special implications for women and racial minorities, as the 
organizational restructuring occurred during the 1970s and 1980s, a period of 
equal employment legislation and expanding social opportunities. The effect 
was to thwart legal and social reforms at the organizational level: 


...at the same time that EEO policies were gaining speed, other forces came 
into play that began weakening the role of internal labor markets across a 
broad range of industries. Hence a basic dimension of EEO strategy—ag- 
gressive internal promotion of women and minority workers—was under- 
mined. Some women and minority workers continued to advance to higher 
echelons, but their progress became increasingly dependent upon a different 
set of factors, involving educational credentials. (Noyelle 1987, p. 15-16) 


Quantitative analyses have confirmed this race and sex segregation (Col- 
clough & Tolbert 1992, Glem & Tolbert 1987, Kraft 1987, Mahung 1984, 
Strober & Arnold 1987). More qualitative analyses have also documented the 
persistence of gender stereotypes that define femininity as antithetical to tech- 
nical expertise (Cockburn 1985, 1991, Hacker 1989). However, this pattern of 
gender segregation may be changing; Wright & Jacobs (1994) found that com- 
puter support occupations (i.e. jobs that support other people's use of com- 
puter systems—computer programmers, systems analysts, computer and sys- 
tems engineers) became less gender segregated during the 1980s, with all com- 
puter support jobs being 36% female by 1991. 


Patterns of Centralization and Decentralization 


Traditionally, workplaces have been kept highly centralized, but with comput- 
erization come opportunities for new patterns of centralization and decentrali- 
zation. Computerized numerical control (CNC), for example, can be used to 
facilitate end-user programming and editing (Noble 1984, Shaiken 1984), and 
personal computers can serve to link relatively autonomous satellite stations or 
work teams (Kanter 1991, Murphree 1984). However, the same technology 
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can be used to maintain centralized managerial control and even surveillance 
(see Office of Technology Assessment 1984, National Research Council 
1986). In some contexts, the computer system may assume the form of visible, 
functional decentralization (e.g. a computer terminal in every office or 
throughout the shop floor) but with an underlying centralization of control (see 
Burris 1993, Prechel 1994). 

Variable patterns of centralization and decentralization have been docu- 
mented, as social and political choices interact with technical considerations 
and system design (Burris 1993, Kling 1996a, Noyelle 1987). Kling (1996a, p. 
295—96) discusses two divergent approaches to system design and implemen- 
tation: "business process reengineering [which] is usually applied by manag- 
ers and consultants to streamline operations" and increase efficiency, and 
“sociotechnical systems design," which emphasizes people and their relation- 
ships with each other and the technology. The former typically implies more 
centralizd control, what Clement (1996) calls a “command and control cul- 
ture," and the latter a more decentralized pattern, and one where end-users 
sometimes play a substantial role in redesigning their work practices (Kling 
1996a, p. 299, Clement 1996). Despite the variable patterns of centralization, 
however, there is also evidence that traditional centralized patterns are the 
norm. The National Research Council (1986, p. 150) concludes from its survey 
of computer automation in diverse white-collar settings that “because innova- 
tions can be implemented in broadly different ways, the major determinant of 
the effects of innovation appears to be management's preexisting employee 
policies." 

Some workplaces have experimented with geographical decentralization in 
the form of “telecommuting” or “telework.” Although currently limited in 
scope, these experiments are significant and may become more prevalent in 
the future (Kling 1996a, Wellman et al 1996). A recent report by the Clinton 
administration (IITF 1994) cites numerous potential benefits from an expan- 
sion of telework: reduced automobile pollution and traffic congestion, im- 
proved quality of work life, smoother integration of work and family life. 

Although research on telework has been limited, some evidence suggests 
that current telework experiments may also result in teleworkers being less 
visible to peers, less likely to be promoted, and more difficult to supervise 
(Forester 1989). Kling (1996a, p. 288) also points out that some employees 
may lack the self-discipline required to work at home amidst home-based dis- 
tractions. Olson (1989) in a study of computer professionals working at home 
full-time found reduced job satisfaction and organizational commitment and 
higher levels of role conflict. Olson & Primps (1984) found that female tele- 
workers were particularly likely to assume greater housework and childcare 
responsibilities and to experience stress deriving from work/family conflicts. 
Wellman et al (1996) argue that telework may exacerbate workplace bifurca- 
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tion, for some studies have found that professional teleworkers tend to benefit 
from computerized social networks and expanded autonomy, whereas clerical 
teleworkers tend to become more isolated and often more stringently moni- 
tored (Olson & Primps 1984). 

Telework highlights issues of centralization and control that are also perti- 
nent to computerized workplaces more generally. Computerized systems of 
production are flexible, although not neutral, and the design of the software is 
critical. Salzman & Rosenthal (1994, p. 6) in their analysis of software produc- 
tion show in some detail how “technology both shapes and reflects the social 
matrix of organizations and socioeconomic systems, of which 1t is a part" and 
how a new technology can be either “assimilated” into existing organizational 
structures or “accommodated” by restructuring the organization (1994, p. 23; 
see also MacKenzie & Wajcman 1985, Thomas 1994). Although there has 
been a tendency to design and implement computerized technology in a man- 
ner consistent with centralized control, this is not inevitable and in fact may 
lead to organizational contradictions, worker dissatisfaction, and further 
change (Hirschhorn 1984, Kling 1996b, Noble 1984, Vallas 1990). 


CHANGES IN WORKER SKILL 


Braverman and the Labor Process Tradition 


Braverman's (1974) analysis of Taylorism and the capitalist labor process has 
been influential for over 20 years, despite some trenchant criticism (Attewell 
1987, Friedman 1977, Stark 1980). Braverman analyzed the ways in which 
scientific management, under the guise of scientific neutrality and objectivity, 
promoted the interests of capitalist managers (1974, p. 86). Taylor (1913, p. 
25) contended that “there is always one method ... which is quicker and better 
than any of the rest. And this one best method ... can only be discovered or de- 
veloped through a scientific study and analysis.” Braverman showed how Tay- 
lorism was implemented so as to deskill the labor process, separating concep- 
tion from execution and transferring conceptual skills to technical experts and 
managers (see Burris in press for fuller discussion). 

Braverman discusses computerization, a trend far from widespread ın 1974, 
only in passing and in fact appears to see the logic of computerization as antt- 
thetic to the logic of Taylorism and capitalism, due to its potential to “re-unify” 
the labor process (see Braverman 1974, p. 328). However, neo-Marxist soci- 
ologists have often found that capitalist social relations tend to shape techno- 
logical design so as to make computerization consistent with capitalist and 
managerial imperatives. Noble (1977, 1979, 1984), for instance, showed how 
numerical control of machining was chosen and umplemented in accordance 
with capitalist and militarist imperatives toward centralized control (see also 
Shaiken 1984). 
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Other empirical studies in the labor-process tradition have also found that 
deskilling of workers results from computerization of production workplaces. 
Cockburn (1985), for instance, found deskilling, increased managerial control, 
and race and sex polarization in her study of the garment industry in England. 
The case studies in Zimbalist (1979) reveal evidence of deskilling in industries 
ranging from carpentry to coal mining. 


Upskilling and Sociotechnical Design Perspectives 


As Attewell (1987, 1992; see also Vallas 1990, 1993) points out, however, a 
contradiction exists between these qualitative case studies, with their evidence 
of deskilling, and more aggregate, quantitative analyses of the labor force, 
which have tended to find substantial upgrading due to an expansion of more 
skilled occupations (Adler 1988, Baran 1987, Spenner 1983). As Barley & Orr 
(1997, p. 3) demonstrate, since 1950 the fastest growing occupational group 
has been professional/technical workers, who comprised 17% of all workers in 
1991. Wright & Singlemann (1982) and Barley & Orr (1997) discuss how 
overall occupational upgrading can coexist with deskilling of specific occupa- 
tions, as the production workforce becomes bifurcated into skilled technicians 
and less skilled operatives. 

In contrast to the neo-Bravermanian labor process literature, then, other 
empirical studies have found skill upgrading, “upskilling,” to be correlated 
with computerization. The classic study 1n this tradition is Blauner's (1964) 
comparison of automation with earlier types of technology, in which he found 
that continuous-process operators were upskilled, had more opportunities to 
learn and grow on the job, were less isolated and often worked in teams, and in 
general were less alienated than workers in industries with less advanced tech- 
nology. 

The sociotechnical conception of work design (Trist et al 1963) emphasizes 
that the social and technological dimensions of work organizations must be de- 
signed to complement one another and that computerized production systems 
are capable of being designed so as to expand worker skill and autonomy. 
Sociotechnical analysts assume that advanced technology, expensive and vul- 
nerable to technical problems, leads to a heightened dependence on operators 
to ensure productivity, cost-effectiveness, and quality control; therefore, to re- 
alize the potential of the technology, decentralization and teams of multi- 
skilled workers who understand the total operation of the plant are needed. 
Hirschhorn (1984) analyzed the interaction between technology and the social 
organization of production in firms such as Olivetti, Fiat, and General Foods, 
and his findings are consistent with sociotechnical analysis: When work was 
organized around self-governing worker teams and worker learning, the result 
was better quality products and superior market position, whereas when 
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worker deskilling and alienation were the norm, workers were unable to effec- 
tively monitor and diagnose the complex technological system, making the op- 
eration vulnerable to technical breakdown and lost productivity. Hirschhorn & 
Mokray (1992, p. 16), in their study of a computer manufacturing plant, show 
how worker "[c]ompetence is shaped through the interaction of a worker's 
skills with the role he or she performs," making skill upgrading and worker 
autonomy necessary for optimal production. 

Zuboff's (1988) case studies of the computerization of diverse workplaces 
is also consistent with the sociotechnical perspective. For Zuboff computeriza- 
tion fundamentally transforms skill by making work more "abstract" (see also 
Barley & Orr 1997, p. 5 ff, Hirschhorn & Mokray 1992, p. 23). Whereas earlier 
types of production involved manual skills and the interpretation of visual 
cues, working with computers involves "the electronic manipulation of sym- 
bols. Instead of a sensual activity, it 1s an abstract one" (Zuboff 1982, p. 145). 

For Zuboff (1988, p. 9 ff), computerized technology can be implemented so 
as to either “automate” or “informate” jobs. When the informating strategy is 
chosen, information about the overall operation ofthe system is more available 
to workers, who are therefore able to learn and develop new skills and compre- 
hensive understanding. In practice, however, Zuboff (1988, p. 252) found that 
such technological potential is often thwarted by managerial reluctance to 
share information and power: *Managers perceive workers who have informa- 
tion as a threat. They are afraid of not being the 'expert"" (see also Kanter 
1983, Noble 1984). 

Zuboff's work indicates general upskilling of production work, with the re- 
definition of jobs around more abstract skill, but limited expansion of worker 
autonomy. Similarly, Vallas & Beck (1996), ın their study of pulp and paper 
mills, found significant upskilling of the jobs of manual workers as they 
learned the computerized system, but no evidence of any expansion of worker 
autonomy or discretion. Instead, they found a persistence of centralized mana- 
gerial control in conjunction with increased reliance on degreed engineers as 
supervisors. Barley & Orr (1997, p. 19) found that the “emergent skills” of 
technical workers are often neither recognized nor rewarded; Creighton & 
Hodson (1997) found that technical workers in diverse settings were skilled 
but lacked power and autonomy. Vallas (1993, p. 184) in his study of AT&T 
found that “while the use of automated systems has at times increased skill re- 
quirements, its overall effects on levels of worker autonomy or responsibility 
have been far less beneficial...”; Iacono & Kling (1996) found that although 
“dramatic improvements in office technologies ... have sometimes made 
many clerical jobs much more interesting, flexible, and skill rich ... these 
changes, especially those involving increased skill requirements, have not 
brought commensurate improvements in career opportunities, influence, or 
clerical salaries” (Kling 1996a, p. 283). 
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The empirical literature on computerization and skill, then, is somewhat in- 
consistent and contradictory. Some have preferred to speak of “skill disrup- 
tion" (Hodson 1988) or "skill restructuring" (Cockburn 1983), thus remaining 
agnostic on the deskilling vs. upskilling controversy. The more recent empiri- 
cal work reveals aggregate upskilling, sometimes combined with the deskill- 
ing of a small number of jobs (Adler 1992a, Attewell 1992). At least three 
trends seem pertinent in explaining the increased salience of upskilling: 1. 
Computerized systems have become more sophisticated, with the develop- 
ment of advanced manufacturing technology and increased reliance on more 
skilled workers (Attewell 1992, p. 70; see also Hirschhorn & Mokray 1992). 2. 
The least skilled jobs have been disproportionately eliminated by the technol- 
ogy (Aronowitz & DiFazio 1994, Rifkin 1995). 3. More managers may be 
choosing to supplement their computer system with skilled workers to maxi- 
mize productivity (reduce downtime) and quality control (Adler 1992b). 


Professional Work 


In recent decades professional work has also been transformed as the ideal 
type of the autonomous, self-employed professional has become the exception 
rather than the rule. As with other types of work, both empirical findings and 
conceptualizations of these changes have varied. 

Some social scientists have concluded that “deprofessionalization” or “pro- 
letarization of the professions” is occurring (Derber 1982, Haug 1973, 1975, 
1977, Larson 1977, Rothman 1984). Haug (1973, 1975, 1977), for instance, ar- 
gues that such developments as rising educational levels among the general 
population, computerization and greater availability of knowledge, and a 
growing societal consciousness of the need for professional accountability 
have contributed to an undermining of professional power and increasing reli- 
ance on paraprofessionals. Larson (1977) focuses more on the trend toward 
professional employment in large bureaucratic organizations and on corporate 
pressures to maximize profits as leading to a more rigid division of labor, 
larger professional caseloads, and the routinization and standardization of pro- 
fessional work. She concludes that “technobureaucratic professionalism” re- 
sults, as “professional status ... no longer msures the incumbent against the 
predominant relations of production in our society” (Larson 1977, p. 233). 
Derber (1982, p. 21) highlights another dimension of proletarization: profes- 
sionals’ increased difficulty in owning and controlling their own means of pro- 
duction, making them more dependent on large institutions for their survival. 

Others have interpreted recent changes in professional work as more con- 
sistent with Weberian bureaucratization, rationalization, and formalization. 
Freidson (1984, 1986), for instance, argues that conceptualizations such as de- 
professionalization and proletarization are exaggerated in that professionals 
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continue to enjoy high status, prestige, and occupational power (see also Der- 
ber et al 1990). Computerization may not adversely affect professionals, since 
"it ıs the members of each profession who determine what is to be stored and 
how it is to be done, and who are equipped to interpret and employ what is re- 
trieved effectively" (Freidson 1984, p. 8). Freidson sees the professions as be- 
coming “formalized,” as a certain internal stratification into an administrative 
elite, a knowledge elite, and rank-and-file professionals occurs (Freidson 
1986). 

Abbott (1988) also focuses upon the increasing systemization and ration- 
alization of the professions and the ongoing redefinition of professional cate- 
gories and jurisdictional boundaries within the system of professions in recent 
years owing to technological and organizational changes. Like the deprofes- 
sionalization school, Abbott also highlights the “commodification of knowl- 
edge," the competition from computerized diagnostic systems, and the ensuing 
routinization and degradation of some professions (Abbott 1988, p. 126 ff). 

Certainly we have seen the creation of many paraprofessional occupations 
in the last few decades, for instance, in the health care field (McKinlay 1982). 
Medical diagnostic systems have also been developed, although not widely 
implemented (Dreyfus & Dreyfus 1986). However, physicians have retained 
considerable occupational power and workplace autonomy. In fact, some evi- 
dence suggests that computerized technology may be enhancing, rather than 
undermining, the work of the physician (Burris 1993, National Research 
Council 1986, p. 159). 

Similarly, the legal profession and the judiciary have undergone internal 
stratification and systemization (Heydebrand & Seron 1990, Spangler & Leh- 
man 1982). Heydebrand (1979) speaks of a "technocratic restructuring” of the 
judiciary in response to a crisis of the judicial system, with increased reliance 
on computerized data banks, role integration of professional and administra- 
tive functions, and more circumscribed and specialized judicial discretion. 
Others have found that Judicial discretion has been enhanced and expanded, 
even as the judiciary has been reorganized and systematized (Aaronson 1977, 
Freidson 1984). 

Some researchers have found polarization of certain professions. Shaiken 
(1984), Kraft (1987), and Kunda (1992) all found polarization within the ranks 
of computer professionals and engineers. Kunda (1992) documents a division 
between “central” and “marginal” engineers, with cultural normative control 
characteristic of the central, exempt engineers and more coercive and utilitar- 
ian contro] characteristic of the nonexempt sector. 

In sum, it appears that technological changes and other rationalization 
measures appear to have differential effects on professional work, depending 
on the relative status of the profession and of the professional within a given 
profession. Professionals within elite professions may lose a certain degree of 
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autonomy as they become more integrated into complex administrative sys- 
tems, but discretion over professional work is generally retained (or even ex- 
panded). Ideologically and economically, they may be more subject to capital- 
ist and bureaucratic imperatives, but not deprofessionalized. Less elite profes- 
sionals and paraprofessionals (health care technicians, nurses, teachers, com- 
puter programmers) may be more vulnerable to professional rationalization. In 
some instances they may be deprofessionalized: their work deskilled, their 
caseloads increased, their contact with clients routinized. For such workers, 
professionalism may be little more than a legitimating ideology (see Burris 
1993, p. 142 ff). 


POWER AND AUTHORITY RELATIONSHIPS 


Increased Salience of Technical Expertise 


With the increased salience of professional and technical workers, it appears that 
traditional rank authority is deemphasized in favor of technical expertise (Burris 
1993, Ilchman 1969). Kanter (1983, p. 55 ft), for instance, found that in the high- 
tech firms she studied "traditional authority virtually disappears; managers 
must instead persuade, influence, or convince" (see also Hodson 1988, Kunda 
1992). Burris (19832, 1993, see also Zuboff 1988) found that "conspicuous ex- 
pertise" and new forms of politicking, centered around expertise, emerge. 

Knowledge and information become important sources of power. Zuboff 
(1988) found that computer conferencing, along with computerized informa- 
tion generally, was restricted to expert-sector workers through the use of pass- 
words, account numbers, and “closed status" designations. Kraft (1977) found 
that conflicts over the locus of authority occurred between experts and manag- 
ers, and that these were exacerbated when managers had limited technical 
knowledge; similarly, Hirschhorn (1984) found that managers of utility com- 
panies were sometimes threatened by engineers' expertise and perceived 
threats to managerial authority. In other contexts, experts and managers have 
formed unified coalitions, and conflicts between workers and technical experts 
have been more common (Burris 1993, Zuboff 1988). 


Worker Autonomy or More Stringent Managerial Control? 


As we have seen, postindustrial analysts have argued that the fundamental 
logic of advanced technology is most consistent with increased worker auton- 
omy and democratization (Hirschhorn 1984, Piore & Sable 1984, Zuboff 
1988). Conversely, some have found that managerial control has been intensi- 
fied and extended by computerized systems (Applebaum & Albin 1989, Pre- 
chel 1994). 

Worker participation experiments have been implemented in many work- 
places (Applebaum & Batt 1994, Smith 1996, US Department of Labor 1994); 
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more ambiguous 1s the significance of these experiments and whether they are 
causally related to technological change. Some have found the level of worker 
autonomy and input into decision-making to be minimal, and the significance 
of the experiments to be largely ideological, a form of “hegemonic” or “con- 
sensual” control (Burawoy 1979, 1985, Hodson et al 1993, Vallas 1993). 
Smith (1996), however, argues that this interpretation, although not without 
validity, misses the fact that the low-level white-collar workers that she stud- 
ied perceived the new system of worker participation to be personally benefi- 
cial, promoting the acquisition of social/relational skills and “a new step on a 
constrained mobility ladder" (Smith 1996, p. 177; see also Hodson et al 1993). 
Kling (1996a, p. 299) also highlights examples of more significant types of 
worker participation, where workers reorganized the way in which computeri- 
zation was implemented so as to create more flexible and less regimented jobs 
for themselves. 

Computerization promotes not only production but also social networking 
(see Wellman et al 1996 for a good review). Office computers can be used for 
recreation, private "conversation" with other workers, nonwork friends, or 
family members, and work-related interaction. In some workplaces, computer- 
mediated communication, with its diminished social presence and greater ano- 
nymity, can be used to cross status and power boundaries, promoting a more 
democratic type of workplace interaction and culture (Wellman et al 1996, 
Zuboff 1988). For relatively autonomous computer conferencing and collabo- 
rative work, now potentially global in scope, computerization can greatly en- 
hance the work process; as Wellman et al (1996) point out, this computerized 
augmentation of collaboration is more salient among professionals and aca- 
demics. 

Computerization is also consistent with more sophisticated and intensified 
systems of managerial control. Zuboff (1988), for instance, found that comput- 
erization can be implemented so as to promote centralized managerial control 
by making workers more visible and vulnerable to supervision, a technologi- 
cally advanced version of the “Panopticon.” Others have addressed the issue of 
computerized monitoring of workers, although there is little consensus about 
the extent of this practice (Attewell 1987, Kling 1996a, p. 286 ff, Garson 1988, 
Marx 1996). Management sometimes monitors not only level of productivity 
and errors, but also the type of on-line activities to ensure "appropriate" use of 
the technology (Orlikowski et al 1995, Zuboff 1988). 

Prechel (1994) found that the large steel corporation he studied, in response 
to global competition and economic crisis, has implemented a “neo-Fordist” 
strategy of “hypercentralization,” “hyperquantification,” and “formalized 
control” (Prechel 1994, p. 737 ff), a system that reduced or eliminated the 
former autonomy of managers and instituted a sophisticated neo-Taylorist sys- 
tem of production control where the “one best way” of doing something is con- 
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sistently utilized. Prechel's case study demonstrates how a firm can flatten its 
hierarchy, utilize the computerized system to achieve functional decentraliza- 
tion and flexibility, yet also strengthen centralized managerial authority over 
production and decision-making. 

Another recent conceptualization of changes in managerial control systems 
is that of "algorithmic" control (Applebaum & Albin 1989, Vallas 1993), 
which Applebaum & Albin (1989, p. 252) define as the reduction of “decision- 
making as much as possible to a set of self-contained rules (algorithms) imple- 
mentable by a computer." Vallas (1993), in his study of AT&T, shows how 
management, through algorithmic control, can simultaneously upgrade 
worker skills and extend managerial control over production by "placing infor- 
mation systems at the directive nodes of the productive circuitry and progres- 
sively removing workers to more peripheral locations in the labor process... 
[so that (in the words of one manager)] 'there are no decisions to be made [by 
workers]’” (Vallas 1993, p. 187). 

Finally, Burris (1993) highlights a neo-Taylorist “technocratic” ideology that 
rests on the increasing centrality of technical expertise and the assumption that 
technical and system imperatives have displaced traditional workplace politics: 
that there is one best technical solution to any problem, which can be found only 
by technical experts. This ideology, in conjunction with the mystique of com- 
puters and a sense of technological determinism and progress, serves to legiti- 
mate the power and privilege of technical experts, to obscure existing workplace 
politics, and to promote consensual control of workers (see also Collins 1979). 


DISCUSSION AND CONCLUSIONS 


Although the trends discussed above appear to correlate with computerization, 
causal inference is made problematic by the fact that computerization is em- 
bedded in a constellation of factors: the internalization of the division of labor, 
intensified worldwide competition and a corresponding emphasis on innova- 
tion, expanded need (and capacity) to manage complex organizations and sys- 
tems and to perfect long-range planning. Technology interacts with social 
preferences and political choices 1n complex ways, making generalization dif- 
ficult (Thomas 1994). The challenge is to assess the impact of computeriza- 
tion, avoiding both the Scylla of technological determinism and the Charybdis 
of technological indeterminism. 

While there are many contingencies concerning computerization, there are 
also observable trends in the empirical literature. This review has revealed the 
need to separate analytically the three dimensions of the workplace examined 
here: organizational restructuring, worker skill, and power and authority rela- 
tionships. One pitfall of existing theories has been the tendency to assume cor- 
respondence among these dimensions. Thus, postindustrial analyses have 
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tended to assume flatter hierarchies, a more skilled workforce, and worker em- 
powerment to be coterminous. Neo-Bravermanian analyses, on the other hand, 
have tended to assume that more stringent centralization, worker deskilling, 
and reduced worker autonomy coexist and reinforce one another. 

Recent empirical work indicates, however, that these dimensions do not al- 
ways line up. We have seen, for instance, that technically skilled workers often 
do not enjoy autonomy or responsibility on the job (Baran 1987, Barley & Orr 
1997, Creighton & Hodson 1997, Iacono & Kling 1996, Vallas & Beck 1996, 
Zuboff 1988). A reduction in levels of hierarchy does not necessarily imply a 
more egalitarian workplace; delayering appears more often to result in a polar- 
ized, “two-tier” workplace and dramatically unequal working conditions be- 
tween the two sectors (Baran 1987, Colclough & Tolbert 1992, Kanter 1983, 
1984, 1991, Hodson 1985, 1988, Noyelle 1987, Smith 1993, 1996, Zuboff 
1988). In some workplaces, centralized decision-making, embedded in the 
computer system, can be combined with a considerable degree of functional 
decentralization and flexibility (Burris 1993, Prechel 1994). With regard to the 
deskilling/upskilling controversy, it has become apparent that both trends can 
coexist, depending on the unit of analysis: aggregate upskilling with some skill 
bifurcation, some deskilling, and considerable skill disruption (Adler 1992, 
Attewel] 1992, Barley & Orr 1997). 

Rather than abandoning the search for general theories about workplace 
change, we need to search for more complex and nuanced theory, multidimen- 
sional and multilevel theory, to understand both the generality and the contin- 
gency of contemporary workplaces. The new types of work and organization 
emerging along with computerization do not readily conform to existing theo- 
ries, but this does not imply that they cannot be theorized. 

In addition to more relevant theory, we also need more objective research. 
As a subdiscipline, the field of the sociology of work has tended to be highly 
politicized, with some researchers influenced by managerial perspectives and 
others by solidarity with workers (see Abbott 1993). [n researching computeri- 
zation, some have taken the situation of expert-sector, professional workers to 
be generally representative of working with computers and have therefore em- 
phasized the positive side of computerization; conversely, others have focused 
upon the situation of non-expert sector workers and have therefore emphasized 
the negative side of computerization (see Burris 1993). Each perspective is 
partially valid. One salient contingency in assessing the impact of computeri- 
zation on contemporary workplaces 1s one's position within an increasingly bi- 
furcated workforce. 

Future research in this area needs to analyze more closely the process of 
technological design and implementation within work organizations so as to 
better specify the interaction between existing power relationships and com- 
puterized systems (Salzman & Rosenthal 1994, Thomas 1994). We need to un- 
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derstand better not only the impact of computerization on work organizations, 
but also the impact of work organization on computerization. Only with this 
more comprehensive understanding can we promote intelligent choices about 


the workplaces of the future. 
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ABSTRACT 


By 1996, 66% of the countries of the world were using elections to choose 
their top leaders. This wave of democratization was accompanied by a para- 
digm shift that took the large number of historically clustered democratiza- 
tions and called it a “wave.” The scholarship has moved beyond overly epi- 
sodic, event-oriented accounts of democratization to comparative work that 
investigates the impact of global processes on the political regimes of na- 
tions. This review examines numerous renderings of the linkage between 
globalization and democratization, including: favorable climate for democ- 
racy, global economic growth, global crises, foreign intervention, he- 
gemonic shifts, and world-system contraction. Those authors who have ad- 
vanced a stronger theoretical integration of the global and domestic pro- 
cesses offer exceptional insight into the momentous shifts that recently have 
occurred. 


INTRODUCTION 


In the last several decades, the world has experienced a democratic revival. In 
1974, only 39 countries (25% of the world's independent nations) were demo- 
cratic. By 1996, 66% were using elections to choose their top leaders (Wall 
Street Journal, June 25, 1996, p. 1). Dismantling totalitarian regimes and re- 
placing them with democratic ones are momentous societal transformations. 
The new democracies were celebrated in a rich and diverse literature that ad- 
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dressed the antecedents and causes of the democratic transitions. Democrati- 
zation in Greece, Spain, and Portugal begot a scholarship that focused on the 
historical and cultural distinctiveness of the respective cases. A few authors, 
observing the "historical clustering," offered an analysis of Europe's southern 
rim. Then, in the 1980s, several Latin American countries embarked on the 
transition to democracy. À new "transitions" literature connected these 1980 
events to those of the 1970s. When that set of democratizations was Joined (in 
the late 1980s and 1990s) by South Korean, Taiwan, Eastern Burope, and even 
South Africa, the “Third Wave" literature was born. 


A wave of democratization is a group of transitions from nondemocratic to 
democratic regimes that occur within a specified period of tume and that sig- 
nificantly outnumber transitions 1n the opposite direction during that period 
of time. A wave also usually involves liberalization or partial democratiza- 
tion in political systems that do not become fully democratic. (Huntington 
1991, p. 15) 


What has come to be known as the Third Wave heralded a paradigm shift in 
the contemporary scholarship. The innovation consisted in conceptually as- 
sembling these geographically and temporally dispersed events into a “wave.” 
It signaled a radical departure from the country-specific idiographic work 
which offered explanations that were overly episodic and event-oriented and 
insufficiently structural or cyclical (Wallerstein 1991, p. 1). The notion of a 
"wave" compels us to consider theoretical and methodological approaches that 
are comparative or even global. If observers of the 1970s transitions could 
comfortably reject internationally oriented theories, later scholars were de- 
prived of this luxury as the number of transitions grew, and as they moved 
from one continent to another. Once established, the wave concept enveloped 
all transitions. 

Some writers were critical of this warning that the wave is more like a hurri- 
cane that sweeps away anything 1n its path. Bunce argues that "a mere three 
years after the collapse of state socialism...eastern Europe was in virtually all 
accounts in ‘transition to democracy.’ Few paused to ask whether this was the 
best way to understand what is happening in this region...” (1993, p. 37). And 
with the exception of Czechoslovakia, Bunce insists, the Eastern European 
transitions are not like Southern Europe and Latin America (1993, p. 43). Like- 
wise, the cases of the Dominican Republic and Ecuador are very distinct from 
those mentioned above, yet their transitions (1978 and 1979, respectively) 
have been grouped into the transition perspective (Conaghan & Espinal 1990). 

As the voices of dissent suggest, the Third Wave does indeed have a stag- 
gering amplitude: It includes countries with a 50-year history of nondemo- 
cratic rule as well as countries that had substantial democratic interludes; it in- 
cludes those terminating traditional authoritarian regimes, populist dictator- 
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ships as well as bureaucratic authoritarian ones; and it includes countries that 
dismantled their nondemocratic regimes with dramatic feats such as the de- 
struction of a wall, the assassination of former leaders, or the uprising of the 
military against the authoritarian regime, as well as those in which the military 
itself negotiated the transition. 

This celebratory literature also permits a wide amplitude in the definition of 
democracy. The debate over the essence of democracy has in no way been re- 
solved in the wave literature. Advocates of popular democracies argue that not 
only must there be widespread participation of majorities in decision-making, 
but the democratic process must be used for achieving social and economic 
justice. Advocates of polyarchy, in contrast, accept a minimalist version in 
which a small group rules and mass participation in decision-making is con- 
fined to restricted choices in periodic elections (Robinson 1996, p. 49). While 
the former stresses outcomes, the latter stresses processes. Likewise, the de- 
. bate regarding the measurement of democracy has not been resolved in the 
wave literature. Should we use a binary measure (presence or absence of some 
institution associated with democracy) as early scholars did, or should we use a 
scale that includes numerous measures of liberty (Bollen & Jackman 1989, p. 
612). Despite legitimate concerns regarding the classification of nations, I 
have passed over the definitional debate in order to highlight what authors 
have written about the causes of those transitions. 

Przeworski traces two major strategies in the research on redemocratiza- 
tion: 1. macro-oriented comparative works that focus on objective conditions 
and speak the language of causal determination; and 2. “micro-oriented studies 
which tend to emphasize the strategic behavior of political actors embedded in 
concrete historical situations" (1986, p. 47). The first implies that regime 
transformations are determined by economic, social, or political conditions. 
But he asks, “Were all the intentional, self-reflective, strategic actors merely 
unwitting agents of historical necessity?" (1986, p. 48). Przeworski's resolu- 
tion 1s to label the first as constraints that do not determine the outcome and 
then to ask within those constraints, “How do alternatives become organized?" 
(1986, p. 53). Many may take this second question as a charge for studying in- 
dividual choice or social movements in 1solation from the macroconstraints. 
The emergence of a global wave, however, necessitates that we analytically 
embed the second question within the first. 

The principal intellectual challenge is to link global processes with domes- 
tic ones and then to show how those domestic processes influence the daily ex- 
periences of both those who rule and those who are ruled. The question *How 
can global change constitute a catalyst for the transition-to-democracy?" can 
be rewritten as several questions: 1. In what way do global shifts affect domes- 
tic economic and/or political processes? 2. How do these domestic changes 
lead elites to withdraw legitimacy from the nondemocratic regime? 3. How do 
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these domestic changes encourage those who are dominated to mobilize 
against the regime? and 4. When the walls come tumbling down, must democ- 
racy rise from the rubble? These questions are depicted as alternative causal 
paths to democratization (Figure 1), and each path is given a letter that will be 
used in summarizing the approaches of individual authors. 

Who addresses these questions? The judgment of Kincaid and Portes re- 
garding development studies is pertinent: 


To cover the vast territory [of]...the sociology of development counts with 
only a relative handful of specialists. The traditional parochialism of Ameri- 
can sociology is faithfully reflected in the fact that events affecting the ma- 
jority of the world's population are usually either ignored or reduced to a few 
variables in quantitative cross-national studies. (Kincaid & Portes 1994, p 3) 


In the period following the exhaustion of the modernization perspective, 
area studies remained the domain of a few sociological outliers. Those who 
had been working on totalitarian states (Goldfarb 1992), authoritarian regimes 
(Linz & Stepan 1978), underdevelopment, or dependent development (Evans 
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1979) were best positioned and equipped to analyze the new phenomenon of 
(re)democratization. Those early transition scholars were followed by others 
who introduced a comparative perspective that was often cross-decade and 
cross-continental (for example, Seidman 1994 and Zubeck & Gentleman 
1994). In contrast to the bibliographies of standard research monographs, 
which tend to implode in the matter of citations, the literature cited in this re- 
view essay draws from a wide range of scholarship. 


THYMOS AND MORAL ROT 


Why did the authoritarian and totalitarian states wither away? Fukuyama's an- 
swer is poetic—the human need for expression (thymos) leads to the perfect 
political form (democracy). Having achieved global democratic acceptance, 
he tells us, we have reached the “end of history." Chirot asserts that the eco- 
nomic failures of state socialism and moral rot of the whole system rendered 
the system morally unbearable for the citizenry. While undeniably cogent, 
many such ex post facto accounts are silent on the issue of threshold. Why did 
the thymos quotient reach its tipping point so many centuries after the first 
Greek model? How did societies live so long with moral rot? In the final analy- 
sis, ex post facto models such as these often invoke "loss of legitimacy" as 
their main explanatory factor. The loss of legitimacy 1s certainly the quintes- 
sential process (Linz & Stepan 1978). It is so quintessential, however, that it 
borders on the tautological: The regime fell, thus it must have lost its legiti- 
macy. Rather than upgrade lost legitimacy to the principal causal mechanism, 
we should incorporate it as the critical intervening process and look to models 
that specify: 1. who specifically withdraws legitimacy, and 2. what accounts 
for their withdrawal. The transition scholarship includes excellent idiographic 
and ethnographic studies that cannot all be mentioned here. For this essay, I 
have drawn from those monographs that grapple with the notion of a global 
catalyst. By piecing together numerous works that touch on some aspect of the 
connection between globalization and redemocratization, we can assemble a 
more comprehensive answer. 


THE GLOBAL CONNECTIONS 


Authors who utilize a global linkage in their analysis offer prima facie evi- 
dence that the Third Wave transitions are not isolated, coincidental, or ran- 
domly distributed in time or space. Despite their shared conviction regarding 
the global or world-system nature of the democratic transitions, they diverge 
significantly in their designation of a global mechanism. I have sorted the 
global factors into six categories. 
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Favorable International Climate 


Democratization in one country becomes the favorable climate, and thereby a 
partial cause, of democratization in others. Huntington argues that a favorable 
global atmosphere operates through diffusion, a demonstration effect, or 
snowballing. When knowledge of political events is transmitted around the 
world, it may trigger comparable events elsewhere (1991, p. 33). Success in 
some parts of the world may encourage other countries to see democratization 
as a solution for their problems. Huntington thinks that greater snowballing in 
the later phase of the Third Wave is due to the expansion of global communica- 
tions and transportation, particularly satellites, computers, and faxes (1991, p. 
101) (Figure 1, path C). 

Numerous authors invoke the effect of a favorable international climate: In 
his analysis of the Greek transition, Diamandouros describes how the climate 
was favorable to democracy: "In Europe...the disrepute into which authoritar- 
ian rule had fallen since the days of Fascist experiments; the widespread àc- 
ceptance ofthe legitimacy of democratic rule and of populist politics; the mul- 
tiplicity of international organizations committed...to the preservation and de- 
fense of democratic politics..." (1986, p. 146). The snowballing concept also 
has been employed in explaining the fate of the Eastern bloc. Holmes argues 
that Gorbachev's criticisms of socialism and his own proposed reforms, 
“helped to undermine the legitimacy of communists everywhere" (1997, p. 
26). Because of a favorable environment—the conviction that the USSR 
would not intervene—Eastern Europeans pursued their own goals (Fi d l, 
path C). 

At a very general level, the favorable-climate perspective helps us TEA 
stand why, at the moment of regime collapse, democracy possesses more le- 
gitimacy than monarchy, reformed totalitarianism, or some other alternative. It 
leaves unspecified, nevertheless, a number of crucial dynamics. What ac- 
counts for the shift of the early trendsetters? Why didn't the model diffuse 
sooner after the Greeks developed it? Why do some nations cast off their 
democratic form? The favorable-climate answer presumes a model of ac- 
tion—the persuasiveness of an idea—in which some of the main puzzles are 
simply assumed away. By failing to take seriously the social, economic, and 
political processes at both the global and domestic levels, the model does not 
contribute much to our understanding of these tremendous transformations. 


| 
Global Industrialization and Development : 


A second perspective in the literature contends that global economic develop- 
ment promotes global democratization. Global growth has eliminated precapi- 
talist niches and resulted in a shift from capital intensive to technology inten- 
sive production. This metamorphosis is sometimes referred to as a “conver- 
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gence" because it appears as though the preexisting distinction between indus- 
trialized and nonindustrialized countries is disappearing. The globalization of 
production unleashes mechanisms that lay the groundwork for democracy in at 
least four ways. Some scholars stress the technological innovations in commu- 
nication and transportation that accompany global capitalism. For other schol- 
ars, industrialization brings with it the growth of professional and a middle 
class, the main carriers of democracy. Still others see 1ndustrialization as 
bringing with it the growth of the working classes, the main agents of democra- 
tization. A fourth approach claims that global growth undermines interven- 
tionist (nondemocratic) states. Global industrialization lessens the previous 
gap between industrialized and nonindustrialized countries: Their economic 
profiles begin to converge. 

A first version of the global industrialization argument privileges the role of 
technology and communications. Technological innovations in communica- 
tion, it is argued, make it more difficult to withhold information from the 
masses and under these circumstances, democracy flourishes. 

“Just as the growing technology of control helped to shape authoritarian re- 
gimes in Latin America, new information technology is shaping Latin Ameri- 
can democracies, especially with regard to elections and public debate in the 
media" (Chalmers 1990, p. 2) (Figure 1, path E,G4). 

Markoff offers a plethora of factors, including communication and trans- 
portation. He attributes the world-wide subscription to democracy to the rise of 
communication and transportation, which increases the capacity of ordinary 
people to develop and sustain social movements (1996, p. 44). He suggests that 
the rise in communication means that both the governing elites and social 
movements pay attention to what other social movements and governing elites 
are doing elsewhere (1996, p. 20). Once the wave begins, Markoff argues, 
models (of sit-ins, underground cells, symbols, etc) and ideas (of social injus- 
tice) can spread throughout the networks of communication and transportation 
(Figure 1, path D,G3 and E,G4). Verdery, in a similar way, claims that technol- 
ogy and communications undermined socialism and contributed to democrati- 
zation. Solidarity's strikes in 1980 were "rebroadcast instantly into Poland via 
Radio Free Europe and the BBC mobilizing millions of Poles against their 
Party" (1993, p. 19) (Figure 1, path E,G4). 

Drawing on a large body of historical literature, Crenshaw ventures an em- 
pirical evaluation of the preindustrial legacy and its impact on democracy. He 
argues that advanced systems of transportation and communication endowed 
some early agrarian states with higher levels of spatial articulation, which en- 
couraged the adoption of political democracy (1995, p. 715). A rise in his agri- 
cultural density index was associated with a rise in the democracy index. 
While there is a wide intellectual moat between his theoretical exposition and 
the twelve regression models based on data from 1960 to 1990, the endeavor 
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reflects an enormously creative leap beyond the routinely cited but regularly 
untested effects of communication and transportation. 

Overall, the technology models are appealing, but they ignore the quandary 
that social and communication networks are content-neutral; fascist and anti- 
democratic ideas can spread equally well. Such models do not tell us wh 
waves begin or why they are democratic rather than authoritarian. These nai 
els have to assume the first perspective, namely that the international climate 
values democracy rather than some alternative. Such models, Therborn as- 
serts, do not show much interest in the social dynamics (1977, p. 7). 

A second rendition of global growth can be found among Huntington’s ex- 
planatory factors. “[E]conomic development appears to have promoted 
changes in social structure and values that, in turn, encouraged democratiza- 
tion” (1991, p. 65). He suggests that global economic integration and industrial 
development provide greater resources for distribution and compromise, cre- 
ate nongovernmental sources of wealth and influence, and open societies “to 
the impact of the democratic ideas prevailing in the industrialized world" 
(1991, p. 66). It is akin to the classic Lipset notion (1960): Increased industri- 
alization brings with it the development of the middle class, which serves as a 
buffer between the wealthy and the 1mpoverished. And, like Lipset, Hunting- 
ton posits that development increases the literate and educated population who 
“develop characteristics of trust, satisfaction, and competence that go with de; 
mocracy" (1991, p. 66). These social strata (as individuals, not as a collectiv- 
ity) are the carriers of democratic values and the agents in the democratization 
process. Rueschemeyer et al in their review of the redemocratization literature 
on Latin America also judge that “the middle classes played a more prominent 
role in the process of democratization than they had in advanced capitalist so- 
cieties” (1992, p. 222) (Figure 1, path A,G2). 

A third inescapable social transformation that accompanies global capital- 
ism is the growth of the working class. The legal emancipation of labor, the 
creation of free labor markets, and the concentration of industrial workers cre- 
ate conditions that foster worker strength and invite popular struggles. In cases 
where the working class needs and/or has allies, the process unfolds as democ- 
ratization rather than revolution or socialist transformation (Therborn 1977, p. 
34). An excellent synthesis of these arguments can be found in the work o 
Rueschemeyer et al (1992). The works highlighted below are a sample of those 
that explicitly link the prodemocratic militancy of working classes to a global 
phenomenon. 

Maravall and Santamaria describe how global convergence generated new 
working classes in Spain and contributed to the breakdown of the Franco re- 
gime. Despite the fact that they begin with the obligatory “exhaustion of the 
economic model and the fragility of the grounds on which the regime was 
based" (1986, p. 74), their analysis hinges upon a global factor. Spain em-, 
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barked on a development project that invited foreign capital and linked Spain 
to international markets. The economic project was accompanied by social 
transformations such as a rural-to-urban migration and an increase in the in- 
dustrial population. These social changes reinforced the organization of the 
democratic opposition and made it difficult to maintain the conservative pat- 
terns that had characterized the Franco regime (1986, p. 75). Their analysis 
points to the working class as the agent of a regime transformation. Whereas 
the second approach offered individual attitudes of trust and civility as the 
democratic building blocks, this third approach requires the collective action 
of workers. And, rather than relying on a favorable climate for democracy, it 
provides an endogenous understanding of why democracy is the preferred re- 
gime replacement. Revolution and democracy offer the best opportunities for 
workers to satisfy their material needs, and excluding the possibility of a so- 
cialist revolution, democracy is the preferred successor to Franco's regime 
(Figure 1, path A,C,G4). But as Therborn argues, "[T]he working-class move- 
ment was nowhere capable of achieving democracy by its own unaided re- 
sources..." (1977, p. 34). This suggests then that our analytical framework 
need go beyond simply pinpointing the strength of one class or another as the 
carrier of democracy. 

A fourth variant of the global growth model is the most nuanced in terms of 
the interplay between globalized capitalism and class-state relations. As pro- 
duction becomes more electronically based and capital moves quickly around 
the world without stopping for long periods in production processes, it limits 
the role of national governments. As globalizing capital hollows out the state, 
new conflicts emerge between the (nondemocratic) state and fractions of the 
capitalist class. Huntington's minimalist version is that "broad-based eco- 
nomic development involving significant industrialization may contribute to 
democratization...” (1991, p. 65) because industrialization leads to a complex, 
diverse, and interrelated economy that 1s more difficult for authoritarian re- 
gimes to control. 

What is named "industrial convergence" at the global level is tantamount to 
“late industrialization" for individual nations. With the globalization of pro- 
duction, some countries that were previously categorized as peripheral or 
semiperipheral (because of their integration into the world-system as raw ma- 
terial exporters) have graduated from that rank. Many countries like Brazil, 
Mexico, and South Africa grew fastest during the golden years of global ex- 
pansion. The fourth interpretation pinpoints how global convergence alters the 
relationship that the state has with fractions of capital on one hand and with the 
working classes on the other. 

Seidman's work on South Africa and Brazil exemplifies this analytical per- 
spective. She asks how the elite, who had cooperated with the previous 
authoritarian state and depended upon it for protection and support, came to 
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challenge that state and to acknowledge the legitimacy of workers’ demands; 
and how “militant strikes and organization...spread from factories to commu- 
nities...to encompass broad demands for social inclusion and citizenship” 
(1994, p. 97). The authoritarian state had ushered in structural changes typi- 
cally associated with late industrialization; specifically, it attracted foreign and 
domestic capital into heavy industry (1994, p. 11). Such rapid state-sponsored 
industrialization shifted the composition of the business elite, putting industry 
in a dominant position over mining. This process was interrupted by the post- 
1973 international recession, which precipitated conflict between elites and 
the state over how to sustain the new industrial growth. Entrepreneurs had the 
perception that they were being closed out of policy-making agendas. Parasta- 
tals in both South Africa and Brazil were criticized for unfairly competing with 
private companies and for spending revenue on relatively unprofitable state- 
owned energy and arms industries (1994, p. 98). Seidman’s state-capitalist 
class analysis provides the grounds for her observation that fractions of the 
capitalist class were withdrawing their support from the nondemocratic state. 

Turning to the working class, Seidman asks if there is something about late 
industrializing countries that spurs workers to apply their militant discourse on 
class to that of citizenship. Rapid industrialization reshaped the industrial 
working class while denying workers and their families access to political and 
labor organizations. Seidman suggests that demanding greater access to state 
decision-making bodies, the industrialists created a political space in which la- 
bor movements could begin to demand the right to organize factory-based un- 
ions. For her, the timing of the new unionism boosted its chances of survival 
because dominant groups were already engaged in debates about democratiza- 
tion (1994, p. 10) (Figure 1, path A,B,G3 and C,G3, and F). 

These four approaches all posit an affinity between global convergence aid 
democratization, yet history cautions against easy generalizations. The early 
insight of Lipset (1960) regarding the affinity between development and de- 
mocracy was turned on its head by a subsequent reverse wave of bureaucratic 
authoritarianism. What Moon writes about South Korea has been written by 
many: "[d]emocracy and globalization have not been necessarily complemen- 
tary. They have often produced ambivalent and conflicting implications” 
(1996, p. 10). While globalization opens up economic spaces for the private 
sector, democratization opens up civil society to public interest and progres- 
sive social groups. In short, societies still confront the hostility inherent in 
democratic capitalism—those who own the means of production do not have a 
monopoly of power, and those who have political power are without owner- 
ship of the means of production (Marx 1964). 

These authors formally insert class conflict into their interpretations. We 
learn from the second and third approaches that 1. global convergence has al- 
tered the social structure in a way that upsets the previous regime, and 2. these 
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newly created groups, middle classes and workers, prefer democracy. The 
fourth interpretation traces how convergence reduces the functions of the state 
and alters the relationship between the state and numerous social classes. This 
global convergence literature adds social processes that were taken for granted 
in the "favorable environment" perspective. 


Global Shocks 


Rather than identifying a secular trend such as global industrialization, another 
literature identifies global shocks that have had world-wide political rever- 
berations. In a minimalist version, any shock may create a legitimacy crisis 
that alters the political regime. This is essentially Hobsbawm's argument re- 
garding the 1929 world market collapse and the subsequent fall of ten oligar- 
chic regimes in Latin America (1967, p. 46). Most prominent in the rede- 
mocratization literature is, of course, the oil crisis. After the 1973 oil crisis, 
countries “such as the Philippines, Spain, Portugal, Greece, Brazil, and Uru- 
guay were particularly hard hit" (Huntington 1991, p. 51), and it is in this 
population of countries that we see the movements toward redemocratization. 
The second oil crisis in 1979 not only fueled the Third Wave, but in "West Ger- 
many, France, Canada, and the United States, incumbent parties were turned 
out of office" (Huntington 1991, p. 51). 

Martins (1986) traces the following path from the global o1l shocks to de- 
mocratization. The first oil shock produced a global recession. The Brazilian 
government, he argues, inattentive to this, continued with its original develop- 
ment program. Instead of delivering growth, that strategy created a domestic 
economic crisis exhibited in skyrocketing domestic interest and inflation rates, 
a drop in industrial production, a rise in the national debt, and an inability to 
service the debt (1986, p. 90). Such precarious conditions widened the gap be- 
tween state promises and state accomplishments. This, and the need to renego- 
tiate the debt with the IMF, "deprived the regime of one of its most efficient 
means of acquiring support from strategic groups" (1986, p. 91) and eventually 
cost the regime its legitimacy. Capitalists joined with the already existing 
mass-based opposition groups, undermining the nondemocratic regime. 

The literature that enters the account with the third world debt crises of the 
19805 also falls into this global shock camp. The debt crisis is the sequel to the 
oil crisis because windfall petro dollars were reloaned to third world nations, 
which had acquired substantial trade deficits owing to the increased cost of im- 
ported oil (Frieden 1991, p. 63). Castells and Laserna link democratic liberali- 
zation in Mexico back to the 1981—1982 debt crisis and the economic restruc- 
turing that it necessitated, They trace a process that began with a core nation: 1. 
"dominant economic aspects of the world economy (the United States) were 
willing to incorporate Mexico in the dynamic core of that economy"; and 2. 


170 SCHWARTZMAN | 


Mexico had the political capacity to pull it off, namely, “the efficient cliente- 
listic system embracing the entire set of relationships between state and soci- 
ety" (1994, p. 68). That economic restructuring increased Mexico's role as an 
export processing platform, increased Mexico's ability to compete globally 
via technological modernization, and increased international inter-govern- 
mental and inter-firm cooperation. The restructuring and integration led busi- 
ness interests to realize they did not need the state as an intermediary with the 
United States. Mexico "no longer needed a central state" (1994, p. 73). These 
newly autonomous business interests generated historic electoral victories for 
the conservative party (PAN). Reduced state control and state ownership 
harmed a bureaucratic middle class whose control of the state had been its 
source of power and prestige. These newly disaffected middle classes pro- 
vided the basis of electoral success for the Cardenas party (PRD). And finally, 
restructuring, debt negotiations, and the austerity programs secured Mexico's 
credibility in the international finance community but led to *popular discon- 
tent over the deterioration of living conditions" (1994, p. 73). This too contrib- 
uted to the electoral success ofthe leftist party of Cardenas (PRD). Castells and 
Laserna have constructed an account that goes beyond the global convergence 
models and offers us insight into the timing and processes of democratization 
(Figure 1, path A,D,B,G3 and C,G4). 


Foreign Intervention 


The contemporary foreign intervention scholarship has highlighted the posi- 
tive role of the United States in promoting democracy. In 1991, the US Con- 
gress tied guidelines regarding democracy and human rights to foreign aid (Jo- 
seph Gichuhi Njoroge, 1996. "Linkage politics: foreign aid and political 
changes in Sub-saharan Africa,” MS: p. 3). And in 1991 and 1992 the United 
States and the Paris Club declared a moratorium on aid to Kenya and to Ma- 
lawi pending the implementation of political reforms such as multiparty de- 
mocracy, cessation of politically motivated torture and imprisonment, and the 
like. The analytical framework that attributes the global ascent of the demo- 
cratic form to foreign intervention faces a colossal challenge. Foreign IET. 
vention does not uniformly bring democracy, sometimes it brings dictatorship. 
The foreign intervention factor includes both long-term residence and 
short-term intervention. Regarding the former, many have hypothesized that 
former British colonies, on the average, had a higher probability of being 
democratic (Bollen & Jackman 1985, Crenshaw 1995). Rueschemeyer et al 
point out that limited suffrage even preceded independence in the British set- 
tler colonies—the United States, Canada, Australia, and New Zealand. British 
indirect rule had this beneficial outcome because it transferred representative 
institutions. In contrast, where the British exercised direct rule, it precluded 
the emergence of a civil society that might have nurtured democracy at a later 
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time (1992, p. 121). In Arab lands, the British armies "played a major role in 
organizing the new...profoundly undemocratic monarchies" (Markoff 1996, 
p. 75). While the legacy of British indirect rule might explain the surge of de- 
mocratization in the post-World War H decolonization period, it gives us little 
purchase on the recent Third Wave. 

The balance of the literature assesses how the foreign policy objectives of 
some countries determine the political regimes of others. "How important are 
the international factors influencing attempts at redemocratization? What mo- 
tivates governments to proclaim the ‘promotion of democracy’ as an important 
goal of foreign policy...?” (Whitehead 1986, p. 3). Roquie argues that the 
hemispheric policy of successive US administrations—the alternation after 
1945 between anti-Communist vigilance and democratizing preoccupa- 
tions—imparts a rhythm to the phases of autocracy and the waves of demilita- 
rization..." (1986, p. 126). This foreign factor is imposed on nations through 
the use of imitation and intimidation. The overthrow of Argentine President 
Frondizi in 1962, for example, was a response to national conflicts that bor- 
rowed heavily from the “defensive perspective outlined by the Pentagon in the 
framework of post-Cuban-Revolution strategic objectives" (1986, p. 126) 
(Figure 1, path G1). During the post-war period, the Common Market (alleg- 
edly) excluded Spain, Greece, and Portugal because of their political regime.! 
While Europe created a “long-term pressure for democratization” (Whitehead 
1986, p. 23), the United States relegated the “promotion of democracy” to a 
lower priority (Whitehead 1986, p. 40). “The greater America’s security con- 
cern, the stronger was its benevolence toward the authoritarian Right" (1986, 
p. 42). From the time of Kennedy (who said he would not be averse to the over- 
throw of the elected Brazilian government) through Reagan, the US govern- 
ment did not behave in ways consistent with its principled advocacy of democ- 
racy (Whitehead 1986, p. 7) (Figure 1, path G1). 

What then accounts for this shift? The end of the Cold War, a new historical 
conjuncture, offering the ruling hegemon(s) the luxury to tolerate and even 
support democracies. Huntington says as early as 1977, the International 
League for Human Rights indicated that human rights had become a national 
policy item (1991, p. 94). Carter started the conversation on human rights that 
Reagan institutionalized in the National Endowment for Democracy (founded 
in 1984). Yet the support for democracy seemed tenuous in the early years. 
Steinmetz (1994) asserts that the United States was only a weak champion of 
democracy to the Shah (Iran), Somozas (Nicaragua), and Marcos (Philip- 
pines). By the 1990s, however, the historical conjuncture of supporting dicta- 
tors appeared to be coming to an end. 


!Poulantzas argues 1nstead that it was fear of redundancy 1n agricultural production 
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Such explanations for the rise of democracy are deduced from an analysis 
of the foreign policy needs of hegemonic powers. This literature clearly estab- 
lishes that decisions to construct particular political regimes are not always en- 
dogenous, rather they receive a heavy contribution from exogenous forces. 
Whats still puzzling 1s why core nations found the promotion of democracy in 
the developing world better suited to global capitalism. From where comes Pus 
good will? 


Shifting Global Hegemon 


The scholarship on hegemons does not stand totally apart from the above men- 
tioned works, to the contrary it builds upon it. In this section, I refer to authors 
who explicitly conceptualize the global impact in terms of a shifting global he- 
gemon. Wallerstein (1991), in his analysis of the collapse of the Eastern bloc, 
offers such a model. The collapse of Eastern Europe is not the triumph of 
Western liberalism; it is the aftermath of US hegemonic decline. Yalta (and the 
cold war) allowed the Soviet Union and the United States to keep order in their 
respective houses. For the USSR, it meant a monopoly over the Communist 
discourse, permitting it to direct or repress revolutionary socialist tendencies 
in Eastern Europe and the Third World. When faced with German and J apa- 
nese economic competition, US hegemony came undone. As the US hegein- 
ony collapsed, so too did the Yalta-approved hold that the Soviets had on its 
own citizens and those of the Eastern Bloc (Figure 1, path G1). | 
In his study of the 1970s' transitions, Poulantzas argues that the shift!in 
global hegemon totally undermined the authoritarian regimes. “The funda- 
mental question regarding the overthrow of the dictatorships in Portugal and 
Greece, and ...Spain, 1s...1n what way have the so-called ‘external’ factors, the 
changes involved in the present phase of imperialism, been reproduced and in- 
ternalized actually within the socio-economic and political structures of these 
countries?” (1976, p. 41). Let me summarize his analytical scheme for the Por- 
tuguese case. | 
Portugal, even in the post-World War II period, still derived its wealth from 
exploiting its colonies. Its international partners also benefitted from the colo- 
nial wealth. The changing world-system, however, shattered this pattern. As 
labor costs rose in the developed world, capitalists began to export capital to 
the less developed world (1976, p. 12) where a more intensive exploitation of 
labor was possible. In the early phase, industrial investments in developing 
countries were characterized by low levels of technology, expatriated profits, 
and concentration in manufactured products. Foreign capital entered Portugal 
in 1960 and boosted the growth ın the industrial sector. While the GNP grew 
1.5% and 5.9% 1n the agriculture and service sectors respectively, it grew 9.1% 
in the industrial sector. The second dimension of his analysis involves the shift 
from American to European dominance. By 1972, West German capital in- 
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vestments in Portugal had overtaken American capital. British capital was 
right behind (1976,p. 25). In Greece, Spain, and Portugal, the percentage of 
trade with the European Union was growing faster than trade with the United 
States. 

The Portuguese project of colonial exploitation had a parallel political for- 
mation comprised of a numerically and politically weak working class; a petty 
bourgeoisie tied to the large state apparatus; and an oligarchy that included 
land owners who functioned chiefly as commercial and financial intermediar- 
ies for foreign capital associated with colonial exploitation. The new wave of 
foreign investment opened a space for domestic industries and gave birth to a 
new bourgeoisie (1976, p. 42). This new economic project differed substan- 
tially from the earlier one: It needed protected markets and state support to be 
competitive internationally. Nonetheless, the Estado Novo (the Portuguese 
authoritarian state) continued to allocate resources to the former project, dedi- 
cating, for example, 5096 of the state budget to the colonial wars. 

The new fractions unsuccessfully sought to renegotiate the compromise 
that the state had made with the older comprador bourgeoisie and to acquire a 
political weight equal to their new economic weight (to break the dispropor- 
tionate grip of agrarian interests). This struggle led to deep conflicts within the 
power bloc which could not be resolved. Herein lies, for Poulantzas, the an- 
swer to the question: Why democracy? To resolve these conflicts without 
bloodshed, the state needed to be organized in a fashion that would permit on- 
going negotiation and resolution, and it needed to allow the various classes to 
be represented by their own political organizations (Poulantzas 1976, p. 48). 


[T]hese military dictatorships did not enable such contradictions to be regu- 
lated by the organic representation of these various fractions within the state 
apparatus, nor did it allow for the establishment of a compromise equilibrium 
without serious upsets....We can add here that the fall or decline of these re- 
gimes corresponded to a redistribution of the balance of forces within the 
power bloc in favour of the fraction of capital polarized towards the Com- 
mon Market and at the expense of the fraction polarized towards the United 
States, whose interests these regimes preponderantly represented, though not 
exclusively. (1976, p. 30) 


But what are we to make of the role of popular struggles in democratiza- 
tion? The authoritarian regimes never were hegemonic among masses. Oppo- 
sition existed in Portugal since the 1926 overthrow of its democracy (Schwartz- 
man 1989), yet "(t)here was no frontal mass movement against the dictator- 
ships, and in this sense, the popular struggles were not the direct or principal 
factor in their overthrow" (1976, p. 78). Nevertheless these struggles were ob- 
viously a determining factor. At this point, we can invoke those processes 
summarized above: The global hegemonic shift that gave rise to the new capi- 
talist class also fostered urban migration and proletarianization of a section of 
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the peasantry. As we have seen, this often increases the volume of class con- 
flict. Since economic protest, such as striking, was illegal under the dictator- 
ship, class struggles became political struggles. Rather than reinforcing 
authoritarian relations, such transformations gave birth to a new pro-dem- 
ocracy coalition —new domestic fractions of capital and workers. In this as- 
pect, Poulantzas’ analysis moves beyond observing the emergence of a new 
working class. He argues that workers and mass organizations will be toler- 
ated by the new capitalists for two reasons. First, because ofthe nature of their 
economic activity (they produce wage goods rather than exported goods), they 
can tolerate a more open and conciliatory position toward trade unions (1976, 
p. 56). Second, they need the mobilized masses for their own struggle against 
the agrarian bloc. After failing to end the dictatorship without the associated 
risks of mass mobilization, the new fractions of capital reluctantly accepted 
mass participation (1976, p. 56). Thus, the transition to democracy resulted 
from | 


| 
...& conjunctural and tactical convergence of interests between the domestic 


bourgeoisie on the one hand and the working classes and popular masses on 
the other, its objective being the replacement of these regimes by “demo- 
cratic" ones. (1976, p. 58) (Figure 1, path A,B,G3 and C,G4 and F) 


In a grammatically less obstructed version of Poulantzas’ argument, Logan 
locates the principal conflict in Spain and Portugal between the competing fac- 
tions of capital tied to irreconcilable strategies of integrating into the world- 
system (1985, p. 149). Such a line of investigation might have application for 
the Eastern European cases. Noting the shifting hegemon, “Gorbachev fre- 
quently spoke of the USSR's European home" (Bergesen 1992, p. 140). | 

In Arrighi’s interpretation of the new hegemonic environment, the shift 
from global hegemonic anarchy to a clear bipolar order had an impact on 
semiperipheral nations. In the postdepression and World War II period, direct 
investment came to supersede trade and territorial expansion as the leading ve- 
hicle of transnational competition: The UK flag of free trade was replaced by 
the US flag of free enterprise. Arrighi argues that the sustained process of capi- 
tal investment, particularly under conditions of global competition (the 
post-World War II split between capitalism and communism), led to the even- 
tual proletarianization of workers, even in rural areas (1985, p. 272). This re- 
sulted in a “resurgence of labor movements in forms that fascism could not 
contain.... [which] pushed political elites towards democratic socialist forms 
of political-economic regulation" (1985, p. 264). For Arrighi, the hegemonic 
shift unleashes its influence mainly through labor (1985, p. 273) (Figure l, 
path A,C,G4 and F,G3). 

In these works, we are supplied a stronger, more explicit relationship be 

tween the economic development of core nations and the political transforma- 
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tions in semiperipheral ones. And, at least for Poulantzas, there is an attempt to 
identify the symbiotic tie between elites and masses as they jointly mobilize 
for democracy. 


World-System Cycles 


World-system cycle models, rather than concluding with something episodic 
or accidental at the global level, invoke the systemic and cyclical nature of the 
world-system. The world-system develops through cycles of accumulation, 
which are composed of phases of capital formation, consolidation, and disinte- 
gration (Arrighi 1994, p. 10). The periods of stagnation, referred to as the B- 
phase, are accompanied by numerous phenomena such as intensified intercapi- 
talist competition (1994, p. 88). B-phases are certainly associated with a reor- 
ganization of production, and they may or may not be associated with a shift in 
hegemonic dominance. Even when the hegemonic leader retains global leader- 
ship, the character of that hegemon will change. Thusly, the United States may 
retain its hegemonic control, but it must capitalize on new technologies and or- 
ganizational forms to do so. In some cycles of decline, hegemonic leaders are 
unable to maintain their monopoly and fall behind new countries that are in a 
better position to exploit new economic opportunities. Bergesen describes, for 
example, how the hegemonic shift from England to the United States at the end 
of the nineteenth century was tied to a shift from the British organization of 
economic production in family firms to the US organization in corporations 
(1983). Poulantzas also placed his analysis on top of a hegemonic shift from 
the United States to Western Europe. While world-system cycle analysts are 
divided about whether a hegemonic shift is occurring, they do agree that the 
exit from the current cycle of stagnation will require new forms of economic 
organization. Furthermore, they link the contemporary Third Wave of democ- 
ratization to the current economic contraction. This assertion diverges radi- 
cally from that of Huntington and others who argue that “[T]he wave of de- 
mocratizations that began in 1974 was the product of the economic growth of 
the previous two decades" (1991, p. 61). 

While the shifts to democracy have taken place around the globe, they are 
not found at all levels of the global hierarchy. Democratization in the Third 
Wave has found more fertile grounds in the semiperiphery. Why democracy in 
the semiperiphery? Semiperipheral countries are hardest hit by the shock 
waves of the B-Phase. At the same time, periods of instability and intracore 
competition give nations in the semiperiphery opportunities to increase their 
relative power. Bergesen argues that democracy is a preferable national strat- 
egy because it is enabling. ^The pressure of the downturn generates social and 
political crises, and. ..semi-peripheral states turn on themselves and reconsti- 
tute internally in the hope of better dealing with the crisis of slow growth, in- 
flation, unemployment, and staggering debt payments" (1992, p. 144). 
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But why should instability in the semiperiphery lead to democracy rather 
than fascism or bureaucratic authoritarianism? We know that one of the do- 
mestic manifestations of a semiperipheral position can be a disarticulated 
economy, which in turn has detrimental effects on democratic stability 
(Schwartzman 1989). We also have an extensive scholarship that illustrates 
the necessity and efficacy of "developmental dictatorships" in the semiperiph- 
ery [Gerschenkron (1966) on late developers in Western Europe, Gregor 
(1979) on Mussolini, O'Donnell on Latin America, and Wallerstein (1979) on 
the world-system, to name a few]. According to these authors, aggressive non- 
democratic states historically have offered advantages over democratic ones. 
By restricting consumption and directing resources toward savings and invest- 
ment, nations such as South Korea, Chile, and Brazil used nondemocratic re- 
gimes to improve their standing in the world-system hierarchy. We need to ac- 
knowledge the empirical fact that semiperipheral nations, which historically 
counted less frequently among democratic nations, now seem to make up the 
bulk of the Third Wave. Furthermore, we have to concede that while there 
were many authors who showed how developmental dictatorships were the 
natural response to a world crisis, there are now many more who are compla- 
cent to describe democracy as the natural response to a world shift. A world- 
system analysis sheds some light on this dilemma. The question must be re- 
vised to read: “Why were late developers more receptive to nondemocratic re- 
gimes in the B-phase of the free trade cycle and to democratic regimes in the B- 
phase of the global convergence cycle? Why is democracy now the way to ne- 
gotiate the new global linkages?" Or, the even stronger version, which synthe- 
sizes aspects of core-semiperipheral relations and cyclical trends, “Why hé 
democratic rule replaced authoritarian rule as the superior handmaiden of ios 
eign capital penetration?" 

One answer can be quickly deduced from the literatures that describe the re- 
lationship between the form of state and world-system location. Wallerstein 
argued that semiperipheral countries were more likely to have “intervention- 
ist" states because of the particular system of labor control found there. 
Authoritarian states were more successful in guaranteeing the semifeudal 
forms of labor control (tenancy or sharecropping) which were more prevalent 
in the semiperiphery (1984). Therefore, these semiperiphery nations, by virtue 
of global economic convergence, have lost their quality of *semiperipheral- 
ness" and taken on some qualities of the core, such as an enlarged working 
class and an industrial bourgeoisie that can contest the power of the landed ar- 
istocracy. Growth of these new sectors reduced the level of economic “disar- 
ticulation," thereby removing the obstructions to democracy. Thus, as coun- 
tries develop and move into this zone, "they become prospects for democrati- 
zation" (Huntington 1991, p. 60). One shortcoming of this deduction is that it 
reduces the world-system component to the *global convergence" argument 
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and is silent about the world-system shifts and their repercussions for non-core 
countries. Furthermore, it doesn't speak to the dilemma of why the United 
States was an enthusiastic supporter of nondemocratic regimes some fifteen 
years earlier. 

We need a model that speaks directly to aspects of this particular downturn, 
how it affects core-semiperipheral economic relations, and how this, in turn, 
contributes to democratization. By incorporating the global crisis into her 
analysis of Poland, Verdery takes one step in this direction. Core nations had 
two reactions to the crisis in the early 1970s: First, they earned money by lend- 
ing abroad, and second, they shifted from Fordist production to flexible spe- 
cialty production. These factors constituted the environment for socialist 
economies, which were equally affected by the global crisis. Governments 
first tried to salvage socialism without transforming it. "Instead of reforming 
the system from within, most Party leadership opted to meet their problems 
by...importing western capital and using it to buy advanced technology (or, as 
in Poland, to subsidize consumption)” (1993, p. 14). The world market was un- 
able to absorb the increased volume of exported manufactured goods coming 
out of the socialist economies, and therefore, borrowers were unable to repay 
their debts. This debt crisis increased the power of that fraction of the ruling 
elite (within the Communist Party) that had advocated structural reforms, in- 
cluding markets and profits. Bureaucrats themselves created private compa- 
nies at the interstices of the socialist state and the capitalist economy: They 
mediated export trade, and they imported computers for the state. 

They also embarked on the new capitalist form of flexible specialization. 
Here, Verdery links the global cycle to democratization by contending that if 
the capitalist world still had been pursuing the Fordist conception of growth, it 
might have been more receptive to state organized “large-scale heroic produc- 
tion...” (1993, p. 16). Having moved to small-scale flexible specialization, 
however, core nations were less sympathetic to state directed growth (Figure 
1, path A,B,C,G3,G4). Komlosy and Hofbauer add that the core nations were 
also less sympathetic to helping the East catch up to the West. Core nations fol- 
lowed the old Cold War regulations forbidding the export of advanced technol- 
ogy to Eastern Europe (1994, p. 135). As we have already seen in the discus- 
sion of Mexico and Brazil, the core response to the economic crisis created a 
new capitalism that struck at the heart of centralized control, undermining a 
host of state functions. Reformers invited foreign capital into their domain, but 
they also invited 1n privatization and other conditions that accompany such 
capital. In this way socialism was undermined by a government that had 
launched a venture to save it. 

Cardoso also builds aspects of this particular B-phase into his work on Bra- 
zil. He posits that the global contraction led core nations to adopt economic 
policies toward non-core nations that not only created conflict between the 
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state and fractions of the capitalist class, but also undermined the regime's sia- 
bility. | 

| 
[T]he evolution of the international economic crisis and the pressure from in- 
ternational partners to place their equipment in Brazil's industrial proj- 
ects... made it difficult for the Geisel government to keep its promise to sus- 
tain the national capital goods sector. Worsening foreign debt, fueled by the . 
importation of foreign equipment...left the government even less room for 
maneuver in maintaining the goals of autonomy... [and] the local capital : 
goods manufacturing sector” (1986, p. 144). “...It was against this back- 
ground that the private sector discovered ‘democracy’ and some industrial- 
ists even rediscovered the constitution. (1986, p. 143) 


In distancing themselves from the state, industrialists rediscovered democ- 
racy— renewal of party-political activity and the emergence of pressure 
groups and social classes” (1986, p. 139), and then they discovered that the 
masses could be partners in this anti-government coalition. At the 1977 assem- 
bly of the National Manufacturing Sector Congress, entrepreneurs defended 
the workers’ right to strike and asked for unqualified democracy and qualified 
economical liberalism (1986, p. 142). In 1978 they issued a public declaration 
which read “We wish to express our view of the path to economic develop- 
ment, based on social justice and promoted by democratic political instifu- 
tions...” (1986, p. 145). For Cardoso, it was clearly the integration of the na- 
tional development program into a “crisis-ridden” “free-enterprise” global 
economy that led the elite to withdraw legitimacy and demand a political open- 
ing. | 

The above B-phase answers link the nature of the current cycle of capitalist 
accumulation to core economic policies in less developed countries. In short, 
they give meaning to the notion of globalization of production. Having estab- 
lished this connection, we can draw from Seidman and others who demon- 
strate how this new globalization has transformed the working classes and led 
to new social movements, which the nondemocratic regimes found difficultito 
contain. In addition, drawing on the work of Poulantzas and others mentioned 
above, we see why the new technocratic and globally integrated elite may not 
have required total repression of the class conflict that was accompanying 
globalization. But why democratization? Democratization can actually de- 
crease levels of mobilization and participation. By transforming a contentio 
and organized working class into a citizenry, democracy can more peacefully 
guide developing countries through the painful process of global integration. 
Because democracy encourages individuals to participate on the basis of un- 
limited collective identities (Catholics, Greens, Bavarians, etc), it fosters the 
individualization of class relations, particularly at the level of politics and ide- 
ology (Przeworski 1985, p. 12-13). Synthesizing many of these linkages, Rob- 
inson argues “All over the world, the Unites States is now promoting its vér- 
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sion of *democracy' as a way to relieve pressure from subordinate groups for 
more fundamental political, social, and economic change" (1996, p. 6) (Figure 
1, path G1,A,C,F,G3,G4). This B-phase world-system perspective seems to 
offer the greatest insights in deciphering the deeper significance of the Third 
Wave of democratization.’ 


CONCLUSION 


The literature contains many informed works that focus in one way or another 
upon the global-civil society connection. What do we know? The majority (al- 
beit not unanimous) conclude that the transitions were not simply popular up- 
risings. Many conclude that fractions of the capitalist classes played a signifi- 
cant part in preferring and achieving an alternative state form—democracy. 
Others stress that domestic transformations are situated in a world-system in 
which core trajectories have inescapable ramifications for the economic and 
political regimes of non-core nations. Our greatest insights, it seems to me, 
come from those works that identify class conflict as the social mechanism 
linking world-system processes to national political dynamics. In this frame- 
work, domestic political structures become part of the evolving transnational 
fabric of economic relations. 

While the culturally distinct and event-specific accounts of transitions to 
democracy give us invaluable insight into the unfolding ofthe transitions, they 
keep us from an understanding of the global networks in which nations have 
been embedded for ages. The historical clustering and its companion wave 
paradigm leave case-study researchers little option but to seek conversations 
with comparativists. Inversely, the macro accounts give us invaluable insight 
into deep structures but withhold from us understanding of individual actors 
and questions of agency. The cross-national variation and its companion 
“voice” literature leave macro researchers little choice but to seek conversa- 
tions with area-study researchers. “They desired freedom" is as deficient of an 
explanatory model as “the B-phase made them do it.” 

Such is a tall order. Specialization means that most researchers are unpre- 
pared to operate both multiple regression and ethnographic vehicles. For meth- 
odological and epistemological reasons, no one researcher is likely simultane- 
ously to arrive at both "they desired freedom" and "the B-phase made them do 
it" conclusions. In fact these two perspectives rarely meet. Yet it is in that 
meeting that we have the most to gain. On the one hand, we have to avoid ex- 
cessive fine-tuning of the hermetically sealed literatures of social movements 
or world-systems which stress single factor dynamics. On the other hand, we 
have to beware the false unions that substitute eclectic aggregation for theo- 
retical synthesis. The excitement and frustrations of citizens in newly democ- 
ratized nations have their counterparts in sociology. It is momentous. 
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ABSTRACT 


The study of social dilemmas 1s the study of the tension between individual 
and collective rationality. In a social dilemma, individually reasonable be- 
havior leads to a situation in which everyone is worse off. The first part of 
this review 1s a discussion of categories of social dilemmas and how they are 
modeled. The key two-person social dilemmas (Prisoner's Dilemma, Assur- 
ance, Chicken) and multiple-person social dilemmas (public goods dilem- 
mas and commons dilemmas) are examined. The second part is an extended 
treatment of possible solutions for social dilemmas. These solutions are or- 
ganized into three broad categories based on whether the solutions assume 
egoistic actors and whether the structure of the situation can be changed: Mo- 
tivational solutions assume actors are not completely egoistic and so give 
some weight to the outcomes of their partners. Strategic solutions assume 
egoistic actors, and neither of these categories of solutions involve changing 
the fundamental structure of the situation. Solutions that do involve chang- 
ing the rules of the game are considered 1n the section on structural solutions. 
I conclude the review with a discussion of current research and directions for 
future work. 


THE QUESTION OF COOPERATION 


Social dilemmas are situations in which individual rationality leads to collec- 
tive irrationality. That is, individually reasonable behavior leads to a situation 
in which everyone is worse off than they might have been otherwise. Many of 
the most challenging problems we face, from the interpersonal to the interna- 
tional, are at their core social dilemmas. 
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As individuals we are each better off when we make use of a public re- 
source, such as public television, without making any contribution, but if ev 
one acted on this conclusion, the public resource would not be provided and is 
would all be hurt. Each farmer does best by taking as much irrigation water as 
possible, and each fisher benefits from catching as many fish as possible, but 
the aggregate outcome of these individually reasonable decisions can be disas- 
ter—groundwater exhausted and fish species depleted to the point of extinc- 
tion. 

This review of the literature on social dilemmas is divided into two anion 
sections. The first is a discussion of categories of social dilemmas and how 
they are modeled. The second is an extended treatment of possible solutions 
for social dilemmas. I conclude with a discussion of current research and diréc- 
tions for future work. 

Any review of this length is necessarily selective. I have focused mainly But 
not exclusively on research since 1980 and on behavioral studies that use ei- 
ther experimental methods or field research. Even with these filters, the work I 
discuss is still a small sample; my goal is to provide a structure for understand- 
ing this area of research and a set of pointers to useful resources. For further in- 
formation I would suggest other reviews by Komorita & Parks (1995, 1996), 
Ledyard (1995), Yamagishi (1995), van Lange et al (1992), Messick & Brewer 
(1983), Stroebe & Frey (1982), Orbell & Dawes (1981), Dawes (1980), Edney 
& Harper (1978a). A number of edited volumes are useful general resources in 
particular the volumes that have come out of the biannual International Con- 
ference on Social Dilemmas: Liebrand & Messick (1996), Schulz et al (1994), 
Liebrand et al (1992), Wilke et al (1986). Other useful edited volumes include 
Schroeder (1995) and Hinde & Groebel (1991). An important set of field stud- 
ies on social dilemmas can be found in Ostrom et al (1994), Bromley et: ial 
(1992), Ostrom (1990), McCay & Acheson (1987), Hardin & Baden (1977). 
There is even a popular press account of these issues in Poundstone (1992). Fi- 
nally, a variety of resources are now available on the World Wide Web. I have 
collected a number of these sources at a Web page devoted to this review: 
www.sscnet.ucla.edu/soc/faculty/kollock/dilemmas (1998b). | 


| 


MODELING SOCIAL DILEMMAS | 


All social dilemmas are marked by at least one deficient equilibrium. It is defi- 
cient in that there is at least one other outcome in which everyone is better off. 
It is an equilibrium in that no one has an incentive to change their behavior. 
Thus, at their worst, social dilemmas exemplify the true meaning of tragedy: 

“The essence of dramatic tragedy," wrote Whitehead, “is not unhappiness. It 
resides in the solemnity of the remorseless working of things” (quoted i in 
Stroebe & Frey 1982). A group of people facing a social dilemma may com- 
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pletely understand the situation, may appreciate how each of their actions con- 
tribute to a disastrous outcome, and still be unable to do anything about it. 

The most severe social dilemmas are also characterized by a dominating 
strategy that leads to a deficient equilibrium. A dominating strategy is a strat- 
egy that yields the best outcome for an individual regardless of what anyone 
else does. The compelling, and perverse, feature of these dilemmas is that there 
is no ambiguity about what one should do to benefit oneself, yet all are hurt if 
all follow this “rational” decision. However, not all social dilemmas involve 
dominating strategies, as we see below. 


Necessary, Dangerous Metaphors 


The literature in social dilemmas has revolved around three metaphorical sto- 
ries that have assumed mythic proportions. These stories—the Prisoner’s Di- 
lemma, the problem of providing Public Goods, and the Tragedy of the Com- 
mons—have served as catalysts facilitating and structuring research. They 
have also served as blinders. The hegemony of these models has at times led 
researchers—or worse, policy makers—to believe mistakenly that these meta- 
phors capture the whole range of social dilemmas or accurately model all em- 
pirical social dilemmas. 

I deal with each of these models and their limitations below as well as with 
other models of social dilemmas that have traditionally received less attention. 
In categorizing social dilemmas, I make the first cut in distinguishing dilem- 
mas that involve only two actors (known as dyadic or two-person dilemmas) 
from social dilemmas involving multiple actors (known as N-person dilem- 
mas, where N is some number greater than two). 


Two-Person Dilemmas 


In 1950 Merrill Flood and Melvin Dresher— scientists at RAND Corporation 
in Santa Monica, California—carried out an informal experiment using a new 
game they had developed. The game was the simplest possible example of a 
social dilemma in that it involved only two people, each of whom faced a sin- 
gle choice between two options (termed cooperation and defection). Albert 
Tucker, a mathematician who was a colleague of theirs, created a story to go 
along with the game that involved two prisoners, which subsequently became 
known as the Prisoner 's Dilemma—the game that launched a thousand studies 
(actually, several thousand). 

The original story involves two prisoners who are separately given the 
choice between testifying against the other or keeping silent (see e.g. Luce & 
Raifa 1957). In my classroom I offer a simple example of the game: Two stu- 
dents are asked to take $1 out of their wallets. Each, in secret, decides whether 
to place the money in an envelope (cooperate) or to keep the money in one’s 
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pocket (defect). Each envelope 1s then given to the other person, and I double 
whatever money has been given. The possible outcomes (in dollars) are seen in 
Figure 1a. The game is marked by the fact that whatever choice one’s partner 
makes, one is better off defecting (1.e. defecting is a dominating strategy): If 
Player II cooperates, Player I's defection brings a payoff of $3 for Player I, and 
$0 for Player II. If Player II defects, Player I is still better off defecting, which 
yields a payoff of $1 for both. Since the payoff structure is identical for both 
actors, they converge on mutual defection even though both would be better 
off if they had cooperated, a move that gives both actors a payoff of $2. In other 
words, it is a deficient equilibrium. This dilemma is at the heart of unsecured 
transactions. For example, when I buy something through the mail, I may be 
tempted to not send a check and the other person may be tempted to not send 
the goods, but if we both defect, we are each worse off than if we had consum- 
mated the exchange. 

What defines the Prisoner's Dilemma 1s the relative value of the four out- 
comes. The best possible outcome is defecting while one's partner cooperates 
(designated DC). The next best outcome is mutual cooperation (CC) followed 
by mutual defection (DD), with the worst outcome being the case in which one 
cooperates while one’s partner defects (CD). Thus, in a Prisoner’s Dilemma, 
DC >CC>DD>CD.! 

Two other important games can be created by switching the relative value 
of the outcomes. If mutual cooperation leads to a better outcome than unilat- 
eral defection (CC » DC » DD » CD), the situation is known as an Assurance 
Game; an example of this game is shown in Figure 15. The name comes from 
the fact that a person would be willing to cooperate as long as that person were 
assured that the partner would cooperate as well. A common misunderstanding 
is that an Assurance Game presents no dilemma and leads inevitably to mutual 
cooperation. In fact, cooperation is not a dominating strategy, and if the person 
believes the partner will defect, the best the person can do is to defect as well. 
In other words, the Assurance Game has two equilibria: mutual cooperation, 
which is an optimal equilibrium, and mutual defection, which is a deficient 
equilibrium. I may be happy to work with you on preparing a joint report, and 
a report to which we have both contributed may be the best possible outcome 
for me, but if I cannot prepare the report myself and I do not believe you will 
cooperate, I am best off defecting as well. The key issue in the Assurance 


l'There is a second mequality that 1s also often included as part of the definition. CC > (CD + 
DCy2 This inequality states that mutual cooperation 1s more profitable that alternating 
exploiation of self with exploitation of other. 

Technically, the equilibria I discuss in this section are known as Nash equilibria. A Nash 
equilibrium 18 "any pair of strategies with the property that each player maximizes his or her payoff 
given what the other player does” (Ostrom et al 1994, p 54) | 





Figure 1 Three two-person games ın their ordinal form: I and II designate Players I and I; C 
and D designate cooperation and defection Player I’s outcomes are shown in bold Nash egui- 
libria are designated with asterisks. 


Game is whether we can trust each other. This game has received much less at- 
tention than the Prisoner's Dilemma Game, although I argue below that it is a 
more accurate model than the Prisoner's Dilemma Game of many social di- 
lemma situations. 

The third game discussed here is created by switching a different pair of 
outcomes in the Prisoner's Dilemma Game. If mutual defection yields a worse 
outcome than unilateral cooperation (DC » CC » CD » DD), we create the 
game of Chicken, which can be seen in Figure 1c. It is named after a game of 
dare that was made famous in the 1955 film Rebel Without a Cause. Two 
youths drive their cars toward each other (or in the case of the film, toward a 
cliff). The first youth to turn away is "chicken" and loses face, while the other 
youth basks in the glory of his courage. However, if neither youth turns away, 
they both end by dying—the worst outcome. If both turn away, the sting of be- 
ing chicken is not as great since both drivers lost their nerve. There are two 
equilibria in the Chicken Game—unilateral defection and unilateral coopera- 
tion. If driving toward each other, you are sure the other person will lose their 
nerve and swerve, you are best off driving straight ahead, but if you believe the 
other person will not swerve, you are better off swerving and losing face rather 
than your life. In this sense, you have an advantage in this game if you can con- 
vince the other person that you are crazy, irrational, suicidal, or otherwise inca- 
pable or unwilling to change course. In such a setting the other driver will 
swerve and you will obtain the best possible outcome. 

An alternate interpretation of the Chicken Game is a situation in which each 
person individually has the ability to produce an outcome that will benefit both 
parties, although providing the benefit involves some cost. Whereas mutual 
cooperation is the unambiguous goal for the Prisoner's Dilemma Game and the 
Assurance Game, that is not necessarily the case for the Chicken Game. If one 
person can provide a joint benefit, then 1t may make no sense for the second 
person to duplicate the effort. The problem comes when each person attempts 
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to “stare the other down,” each refusing to budge and hoping the other will give 
in and cooperate. The key problem then is avoiding a stalemate that results i in 
the worst possible outcome.? 

Note that unlike the Prisoner’s Dilemma Game, neither the Assurance 
Game nor the Chicken Game has a dominating strategy. In the latter two games 
the partner's choice is crucial in determining one's best outcome—one wants 
to match the partner's choice in the Assurance Game and to make the opposite 
choice in the Chicken Game. 


Multiple-Person Dilemmas 


The first cut in categorizing social dilemmas was distinguishing between two- 
person and N-person dilemmas. Within N-person dilemmas we make the next 
cut, distinguishing between two broad types of multiple-person dilemmas in 
terms of how the costs and benefits are arranged for each individual (Cross & 
Guyer 1980). In the first type, known as a social fence, the individual is faced 
with an immediate cost that generates a benefit that is shared by all. The indi- 
vidual has an incentive to avoid the cost, but if all do so each is worse off than if 
they had managed to “scale the fence.” In the second type, termed a social trap, 
the individual is tempted with an immediate benefit that produces a cost shared 
by all. If all succumb to the temptation, the outcome is a collective disaster.4 
Within each of these broad categories lies a richly developed metaphor that has 
driven research in the area: (a) the provision of public goods (a social fence) 
and (5) the tragedy of the commons (a social trap).? The potentially noxious 
outcomes of both types of social dilemmas stem from what economists refer to 
as externalities, which are present “whenever the behavior of a person affects 
the situation of other persons without the explicit agreement of that person or 
persons” (Buchanan 1971, p. 7). Broadly speaking, externalities are uncom- 
pensated interdependencies (Cornes & Sandler 1996). 


PUBLIC GOODS DILEMMAS A public good is a resource from which all iis 
benefit, regardless of whether they have helped provide the good—I can enjoy 
public television whether or not I contribute any money, and I can enjoy the 
parks in my city even if I do not pay municipal taxes. This is to say that public 


3Poundstone (1992, p. 203) reports an anecdote by Anatol Rapoport that “the Fuegian m 
of the natives of Tierra del Fuego contams the word mamihlapinatapm, meaning, ‘looking at each 
other hoping that either will offer to do something that both parties desire but are unwilling to do!” 

^Note that this general formulation of social dilemmas rests on learning models rather than on 
any assumption of more complex strategic decision-making, as 13 common in the economic models 
to i discussed. 

Other important dimensions along which social dilemmas can be categorized include! the 
temporal lag between the original action and the eventual disaster (e.g Messick & Brewer 1983) 
and whether the actors 1n a social dilemma each face an identical incentive structure or not (e. g. 
Marwell & Oliver 1993). ; 
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goods are non-excludable and as a result there is the temptation to enjoy the 
good without contributing to its creation or maintenance. Those who do so are 
termed free-riders, and while it is individually rational to free-ride, if all do so 
the public good 1s not provided and all are worse off. This decision is based on 
greed, 1.e., the simple desire to obtain the best possible outcome for oneself. 
There is also a second reason that can lead to defection—a person may be will- 
ing to cooperate but fear that not enough others will do so to actually provide a 
public good. Rather than greed, the concern here is the fear of being a sucker, 
i.e., throwing away one’s efforts on a lost cause. 

Public goods are also distinguished by the fact that they are nonrival 
(Cornes & Sandler 1996) in that one person's use of the good does not dimin- 
ish its availability to another person—my enjoyment of public television does 
not make less of it available to anyone else. A pure public good is completely 
nonexcludable and nonrival, but many public goods exhibit these two qualities 
only to a varying degree. The basic problem was described at least as early as 
1739 by Hume, articulated by Samuelson in 1954, and made famous by Olson 
in 1965 with the publication of The Logic of Collective Action. 

A key characteristic of public goods dilemmas is the relationship between 
the level of resources contributed toward the production of a public good and 
the level of the public good that is provided. This relationship is known as the 
production function (Marwell & Oliver 1993, Heckathorn 1996). Production 
functions can take on any number of forms, but the four basic production func- 
tions shown in Figure 2 can be used to model many of the most important dy- 
namics in public goods dilemmas.® 

With a decelerating production function (Figure 2a), initial contributions 
have the greatest effect, with additional contributions generating increasingly 
diminishing returns. With a linear production function (Figure 26), each unit 
of resource contributed produces the same return. An accelerating production 
function (Figure 2c) produces few returns for the initial contributions but 
brings increasing returns as the contributions increase. Finally, discontinuities 
in the production function, such as the step-level function in Figure 2d, create 
thresholds (also known as provision points). In these cases little or no amount 
of the public good is produced until a certain level is reached, at which point a 
small increase in the level of contributions returns a large and discontinuous 
amount of the public good. In the analysis of public goods, one of the most im- 
portant distinctions 1s whether there are threshold points in the production 
function (Ledyard 1995). 

A common misunderstanding is the assumption that all N-person dilemmas 
have the structure of an N-person Prisoner’s Dilemma Game. That is, that there 


6Some production functions are combinations of these basic forms A third-order S-shaped 
curve, for example, 1s composed of accelerating, linear, and deceleratmg components. 
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Figure 2 Four production functions (a) Decelerating, (b) Linear, (c) Accelerating, (d) Step 
Function 


is a dominating strategy that leads to a deficient equilibrium in which no one 
cooperates.’ In fact, various production functions can yield N-person versions 
of the Assurance Game and Chicken Game as well as the Prisoner's Dilemma 
Game.? Note that if a situation has the structure of an Assurance Game, there is 
no temptation to free-ride—the only concern is whether one will be a “sucker.” 

In a Chicken Game, however, the incentive to free-ride can be even more se- 
vere than in the Prisoner's Dilemma Game (Yamagishi 1995). | 


COMMONS DILEMMAS The second mythic story commonly used in describ- 
ing N-person dilemmas is the tragedy of the commons. Early statements of the 
basic problem can be found in Aristotle (Politics, Book II, Chapter 3).9 The di- 
lemma in its modern form was carefully described by Lloyd in 1832 and made 
famous by Hardin in 1968 when he published his article in Science on the 
topic. Hardin described a group of herders having open access to a common 
parcel of land on which they could let their cows graze. It is in each herder’s in- 
terest to put as many cows as possible onto the land, even if the corumons is 
damaged as a result. The herder receives all the benefits from the additional 
cows, and the damage to the commons is shared by the entire group. Yet if all 
herders make this individually reasonable decision, the commons is destroy: 
and all will suffer. When timber is harvested faster than it can grow or when 
fish are caught in greater numbers than their reproductive capacity, we face 
tragedies ofthe commons. Here again the problem is the non-excludability of a 
joint resource, but unlike public goods, a key feature of commons dilemmas is 
| 

7 This belief has led to some misguided critiques of the game theoretic models underlying w - 
on social dilemmas For a rejoinder to these flawed criticisms as well as an honest discussion n 
limits of game theoretic models, see Lohmann (1995) 

8Heckathorn (1996) has written a very useful general analysis of different types of games la 

how they are the result of various production functions in combination with the relative value of E 
public good (note that this analysis concerns dyadic games) See also Schelling (1978) for 
influential discussion and method of modeling different social dilemmas. 

9 And later in traditional rhymes, “They hang the man and flog the woman, That steal the goose 
from off the common, But let the greater villain loose, That steals the common from the eal 
(quoted m Faurlie et al 1994) 
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the subtractability of the benefits (the opposite of being nonrival): The tree I 
cut, the fish I catch, and the water I use are not available for others. 

For commons dilemmas, the issue is not the production function but the car- 
rying capacity of the commons, which is a function of its replenishment rate. 
Different resource pools will be renewed at different rates—the reproduction 
rate of a species of fish, the yearly rainfall that adds to groundwater reserves, 
the rate at which pollutants dissipate 1n the air; this will determine the rate at 
which the subtractable joint resource can be appropriated without exhausting 
the commons. 

In sum, public goods dilemmas concern the production of, and commons di- 
lemmas involve the use of, a joint good from which it is difficult to exclude 
others, There are many reasons why excluding others might be costly (Ostrom 
et al 1994): the physical nature of the resource (it is difficult to fence in ocean 
fish or exclude tax scofflaws from the benefits of secure national borders); the 
available technology (enclosing huge range lands used to be prohibitively ex- 
pensive—-until the introduction of barbed wire); or existing laws and tradi- 
tional norms (which might prohibit anyone being excluded from a commons or 
public good). These two basic forms of N-person dilemmas are further distin- 
guished by the fact that public goods are significantly nonrival, whereas com- 
mons dilemmas involve a subtractable resource.!o 

The move from two-person to N-person dilemmas involves a number of 
profound shifts that affect the dynamics of the game. Dawes (1980) described 
three important ways in which the two-person Prisoner's Dilemma Game dif- 
fered from the N-person version: First, in an N-person dilemma, one's actions 
are not necessarily revealed to others—anonymity becomes possible and an in- 
dividual can free-ride without others noticing her or his actions. In the two- 
person case, each player "knows with certainty how the other has behaved" 
(Dawes 1980, p. 51). Second, the cost one 1mposes on others from defecting is 
focused completely on one's partner in the case of a two-person dilemma, 
whereas it is diffused throughout the group in an N-person dilemma. Finally, in 
a two-person dilemma, each person has significant control over one's partner's 
outcomes and so can shape the partner's behavior in important ways. In con- 
trast, in an N-person dilemma, one may have little or no direct control over the 
outcomes others receive. 

This list of distinguishing features is a useful starting point, but as we think 
about the whole range of social dilemmas, it is important to keep in mind two 
significant qualifications. First, anonymity, the diffusion of the harm of defec- 
tion, and the inability to significantly impact others’ outcomes are possible 


104 nonrival good from which it 1s easy to exclude people 1s known as a club good (see Cornes 
& Sandler 1996 for an extended discussion of club goods). Goods that exhibit both subtractability 
and excludability are termed private goods 
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though not inevitable features of N-person dilemmas. Kollock & Smith (1 996), 
for example, discuss large-scale dilemmas that do not exhibit all of these féa- 
tures. Second, some of the features identified as characteristic of N-person di- 
lemmas can be found in two-person dilemmas. For example, in the Prisoner's 
Dilemma Game as it is traditionally played in the experimental lab, each player 
knows with certainty bow the partner has acted, but this is not the case for 
many of the two-person dilemmas in our empirical lives. I may promise my 
partner that I will run an errand for her while I am out, decide to take care only 
of my needs, and then claim upon returning that heavy traffic prevented me 
from fulfilling her request. 


SOLVING SOCIAL DILEMMAS 


In this section possible solutions to social dilemmas are considered. These $0- 
lutions are divided into three broad categories based on whether the solutions 
assume egoistic actors and whether the structure of the situation (“the rules:of 
the game") can be changed. Motivational solutions assume actors are not com- 
pletely egoistic and so give some weight to the outcomes of their partners. 
Strategic solutions assume egoistic actors, and neither of these categories of 
solutions involves changing the fundamental structure of the situation. Solu- 
tions that do involve changing the rules of the game are considered in the third 
section on structural solutions. I have grouped together both structural so u- 
tions that assume egoistic actors and structural solutions that assume some 
weight is given to what others receive. 


Motivational Solutions | 


Do individuals take their partners' outcomes into account when making a deci- 
sion? Many of the models in the literature on social dilemmas assume actórs 
who are focused only on their own outcomes, but it seems clear that many of us 
do give some weight to what our partner receives. As Dawes (1980, p. 176) 
commented, “Few of us would accept $500 with nothing for our friend in lieu 
of $495 for each of us." | 

| 
SOCIAL VALUE ORIENTATIONS Research on social value orientations (e.g. 
Kuhlman & Marshello 1975, McClintock & Liebrand 1988) has sought to de- 
termine if there are stable individual differences in "preferences for particular 
distributions of outcomes to oneself and others" (van Lange et al 1992, p. 17). 
Many different social value orientations are theoretically possible, but most 
work has concentrated on various linear combinations of individuals’ concern 
for the outcomes for themselves and their partners. One possibility is that an 
individual might behave so as to maximize joint outcomes (this is described as 
a cooperative orientation in this literature). An individual might also desire'to 
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maximize the relative difference between self and partner (a competitive ori- 
entation). Other orientations include maximizing the partner's outcome with- 
out regard for own outcome (altruism) or maximizing own outcome without 
any concern for the partner's outcome (individualism).!! Research in numer- 
ous countries has found that most individuals can be classified as either coop- 
erators, competitors, or individualists. 

This work has shown that individuals with different social value orienta- 
tions behave differently when faced with the same objective game (McClin- 
tock & Liebrand 1988, Liebrand et al 1986, Kramer et a] 1986, Liebrand 1984). 
The researchers in this area have also shown that these orientations are rela- 
tively stable over time (Kuhlman et al 1986).!* These studies make use of 
work by Kelley & Thibaut (1978) and Kelley (1979) on matrix transforma- 
tions. Kelly & Thibaut argue that individuals often subjectively transform a 
given game and play it as if it were another game. There is, after all, no guaran- 
tee that subjects play an experimental game as intended by the researcher—for 
any of a variety of different reasons people might value particular outcomes 
more or less than the immediate objective payoff they receive. 

Work on social value orientations has concentrated on assessing transfor- 
mations that are the results of personality traits. Understanding that some indi- 
viduals routinely give different weights to their own and partner's outcomes is 
an important piece of information in explaining the observed rates of coopera- 
tion in social dilemma situations. However, this knowledge 1s not very useful 
as a solution to dilemmas—this research does not tell us how to increase the 
level of cooperation. 

One possibility would be to study how social value orientations are formed. 
A group of researchers (McClintock & Keil 1983, Toda et al 1978, McClintock 
1974) have studied the development of cooperative and competitive orienta- 
tions in children in several countries. Among other results, they have found 
that competition seems to be learned significantly earlier than cooperation and 
that overall levels of competitiveness can vary from country to country. But 
until the actual mechanisms are identified by which social orientations are 
learned, these insights still do not provide the basis for intervention in a social 
dilemma. Along these lines, a few studies (Frank et al 1993, Marwell & Ames 
1981) have examined the effects of education later in life, examining whether 


llOther more exotic orientations are also possible, such as martyrdom and sadism. Note that not 
all orientations can be modeled within this framework—another important social value orientation 
is minimizing the difference between own and partner's outcomes, this 18 related to issues such as 
equality and justice 

Research has also found that stable individual differences exist in how trusting a person is and 
that this affects cooperation levels in social dilemma situations (e.g. Yamagishi 1986) Another 
individual difference that ıs noteworthy 1s the finding that people who cooperate are more likely to 
expect others to cooperate as well (Orbell & Dawes 1993) 
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| 
students in different majors are more or less likely to cooperate. The amusing 
outcome is that some evidence suggests that people who study economics are 
more likely to free-ride. However, there are questions (mostly by economists) 
about how this result should be interpreted and how robust the finding is. 


COMMUNICATION Another approach is to ask what features in the situation or 
environment (rather than in an individual’s personality) affect the weight indi- 
viduals give others’ outcomes. One of the most robust findings in the literature 
is the positive effects of communication on rates of cooperation Across a wide 
variety of studies, when individuals are given the chance to talk with each 
other, cooperation increases significantly (e.g. Orbell et al 1990, Orbell et al 
1988, Liebrand 1984, Edney & Harper 1978b, Dawes et al 1977, Jerdee & Ro- 
sen 1974). While the effect is readily observed, explaining it has been more of 
a challenge. Messick & Brewer (1983) suggested four reasons (which touch on 
both motivational and strategic factors) why communication might increase 
cooperation. First, individuals may be able to gather information about the 
choices others are likely to make. This information, however, can have am- 
biguous effects. If I believe that most other people will cooperate in an N- 
person dilemma, does that give me a reason to cooperate or a greater tempta- 
tion to defect? In part the decision will depend on the structure of the dilemma 
(1n an Assurance Game, I will be happy to cooperate if others do) and on one's 
social value orientation. Second, communication gives group members the 
chance to make explicit commitments and promises about what they will do. 
However, research has been inconclusive about whether such commitments 
have an effect on cooperation rates (Orbell et al 1990, Dawes et al 1977). 
Third, communication offers an opportunity for moral suasion, i.e., appeals to 
what is the "right" or “proper” thing to do. The effects of moralizing have been 
the subject of very little research, although there are at least some indications 
that it can have a salutary effect on cooperation (Orbell & Dawes 1981). Fi- 
nally, communication may create or reinforce a sense of group identity. This 
last point seems especially important, and the opinion of one of the key re- 
searchers in this area (Dawes 1991) is that the key effects of communication 
come from eliciting group identity. | 

GROUP IDENTITY The impact of group identity is manifold and profound, 
having effects across all three categories of solutions: motivational, strategic, 
and structural. Indeed, group identity can have such a powerful effect that it 
can influence rates of cooperation even in the absence of communication. Kra- 
mer & Brewer (1984, 1986; Brewer & Kramer 1986) have demonstrated that 
subjects are more willing to exhibit personal restraint in a commons dilemma 
simply as a result of being identified as members of a common group. Inter- 
group competition can have even more striking effects. In a study involving 
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naturally occurring groups, Kollock (1998a,b) uncovered evidence of consis- 
tent transformations of a social dilemma situation, such that a Prisoner's Di- 
lemma was treated as an Assurance Game when the partner was an in-group 
member and as a Prisoner's Dilemma Game when the partner was an out- 
group member. Experimental work by Bornstein and Rapoport (Bornstein et al 
1990; Rapoport et al 1989; Rapoport & Bornstein 1987, 1989; Bornstein & Ra- 
poport 1982) and the classic field experiments of Sherif et al (1961) have 
shown the powerful effect of intergroup competition in promoting cooperation 
within groups. However, this solution can be double-edged. Encouraging or 
creating group competition can serve the needs of group members (and leaders 
and politicians), but the social costs of the conflicts that result between groups 
can be severe. 

Why are individuals more willing to cooperate if they feel part of a group? 
One possibility is that a collective social identity increases the altruism of the 
members. This is certainly a possibility, but something more strategic may be 
happening as well. Indeed, it can be difficult to distinguish apparent altruism 
from subtle long-term strategic considerations. This issue is taken up in the 
next section. !3 


Strategic Solutions 


Strategic solutions assume egoistic actors and no changes to the structure of 
the game. These approaches rely on the ability of actors to shape the outcomes 
and hence behavior of other actors. For this reason, many of these strategic so- 
lutions are limited to repeated two-person dilemmas. 


RECIPROCITY Far and away the most influential study on strategic solutions 
to social dilemmas is Axelrod’s The Evolution of Cooperation (1984), in 
which he reports the results of a series of computer tournaments investigating 
the two-person Prisoner’s Dilemma Game. While research on the Prisoner’s 
Dilemma Game had gone on for many years prior to Axelrod’s book, it was 
distinguished by its intriguing method, a provocative set of conclusions and 
recommendations, and arresting examples taken from such diverse areas as bi- 
ology and the history of trench warfare. The study centered on two tourna- 
ments in which prominent game theorists (and in the second tournament, com- 
puter hobbyists) were invited to submit strategies for playing the Prisoner’s 
Dilemma in a round-robin contest. 

Axelrod identified three requirements in this environment for there to be 
even the possibility of the emergence of cooperation. First, 1t was essential that 


13Note also that Kuhlman et al (1986) have argued that social value onentations themselves 
might be explained as different strategic responses to the problem of how to maximize one’s own 
outcomes. 
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individuals be involved in an ongoing relationship. If individuals met only 
once, or equivalently, if this was the last time they would meet, the dominating 
strategy to defect in the Prisoner's Dilemma Game would make the pursuit of 
cooperation hopeless.1^ If the partners would meet again in the future, coop- 
eration at least has a chance. The second condition is that individuals must be 
able to identify each other. The third condition is that individuals must have in- 
formation about how the other person has behaved in the past. If identity is un- 
known or unstable and if there is no recollection or record of past interactions, 
individuals will be motivated to behave selfishly because they will not be ac- 
countable for their actions. 

It was surprising to many at the time that the winner of Axelrod’s two tour- 
naments was the simplest strategy that had been submitted. This strategy, 
named Tit-for-Tat, cooperates on the first interaction and thereafter simply 
does whatever its partner did on the previous round. It has proven to be an ef- 
fective strategy in many different environments, and it has the effect, in es- 
sence, of transforming a repeated Prisoner's Dilemma Game into a repeated 
Assurance Game (Yamagishi 1995; cf. Rapoport 1967). Playing against an in- 
dividual using Tit-for-Tat means that the only long-term possibilities are mu- 
tual cooperation and mutual defection—there is no hope of exploiting this 
strategy in any kind of sustained way. In this sense it can provide a route to sus- 
tained mutual cooperation in a two-person Prisoner's Dilemma Game. | 

After studying the most successful strategies in the tournaments, Axelrod 
(1984, p. 110) distilled four pieces of advice that he would offer an individual 
playing an iterated Prisoner's Dilemma Game: (a) Don't be envious; (b) don't 
be the first to defect; (c) reciprocate both cooperation and defection; and (d) 
don't be too clever. The key point in his fourth piece of advice was that it was 
important for one's partner to clearly understand what strategy one was using. 
His first piece of advice is essentially an admonition against playing the Pris- 
oner's Dilemma Game as if 1t were a zero-sum game, that is, a game in which 
one's interests were completely opposed to one's partner's (e.g. chess, com- 
petitive sports, mortal combat). In a zero-sum game, using one’s partner as a 
standard of comparison is useful, as anything that works against one's partner 
necessarily helps oneself. However, trying to beat one's partner or being envi- 
ous of their success!? can lead to trouble in a mixed-motive situation such p 
the Prisoner’s Dilemma. Trying to beat your partner can be self-defeating if 
results in mutual defection. | 


14 Axelrod's analysis is based on the logic of game theory. Empirically, one does sometimes 
Observe cooperation 1n a one-shot Prisoner's Dilemma Game (e g Orbell & Dawes 1993, Hayash: 
et al 1997). 

I5This can be thought of as negatively weighting one's partner's outcomes, as is the case with 
the competitive social orientation discussed above. | 
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À very important lesson from the tournaments was that Tit-for-Tat won not 
by beating its partners (indeed, it can only tie or do slightly worse than its part- 
ners), but by doing well on average, encouraging mutual cooperation with 
many of its partners. This seems to be one of the hardest lessons for individuals 
to learn, perhaps because of the dominance of the competitive game as a model 
in many cultures—if the only metaphor you have is the zero-sum game, you 
tend to treat everything as if it were a war. The book spawned a cottage indus- 
try of hundreds of studies that supported, extended, or critiqued the original 
work. Two very useful reviews of research that make use of Axelrod's studies 
are Axelrod & Dion (1988) and Axelrod & D' Ambrosio (1994). 

The success of Tit-for- Tat led some commentators to suggest that this strat- 
egy be used as the basis of everything from childhood education to interna- 
tional relations. Here again we see the dangers of taking a useful metaphor too 
literally, assuming it accurately modeled any situation that even vaguely re- 
sembled a Prisoner's Dilemma. As Axelrod himself appreciated, the results of 
his tournaments depended on both the particular sample of strategies that were 
submitted and the assumptions underlying his study. One of the most impor- 
tant scope conditions of Axelrod's simulations was the assumption of perfect 
information. In a world in which mistakes, misperceptions, and accidents can 
occur, Tit-for-Tat can turn out to be an unsuccessful strategy because it retali- 
ates immediately (Kollock 1993). Strategies that are more generous or forgiv- 
ing than Tit-for-Tat can have important advantages in such settings because 
they avoid the danger of cycles of recrimination that can occur with Tit-for- 
Tat. 


CHOICE OF PARTNERS Another key assumption of Axelrod's model was the 
network structure of the interacting strategies. In a sense it represented a very 
unusual social structure in which each actor was forced to interact each round 
and to interact with every other possible partner (as this was a round-robin 
tournament). Not playing the game or choosing only some partners with whom 
to interact were not options. One of the most important recent developments 
has been studies that permit players to exit a current relationship and/or choose 
alternative partners. Computer simulations by Schuessler (1989), Vanberg & 
Congleton (1992), and Hayashi and associates (Hayashi 1993, Yamagishi et al 
1994) all found that a very successful strategy in these situations was to coop- 
erate on the first interaction and continue cooperating until the first defection 
from one's partner, at which point the strategy exited the relationship. Hayashi 
(1993) also discovered that a version of this strategy (called Out-for-Tat), 
which incorporated some degree of forgiveness (i.e. a willingness to give a 
partner who had defected before a second chance), was even more successful. 
The conclusion of this work is that the strategy used in selecting one's partner 
can be more important than the strategy that is used in actually playing the 
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Prisoner's Dilemma Game. Experimental work by the same researchers 
(Yamagishi et al 1994) suggests that subjects follow something like an Out- 
for-Tat strategy in which the response to defection is not defection, but deser- 
tion. 


GRIM TRIGGERS All of these studies involved two-person social dilemmas. 
Strategic solutions for N-person dilemmas are much more of a challenge be- 
cause one's own actions may have little or no influence on what others do. One 
possibility that has been explored is the adoption of a "grim trigger" strategy, 
in which each individual agrees to cooperate only on the condition that all oth- 
ers in the group cooperate. In theory, if all adopt this strategy then each per- 
son's decision is decisive and free-riding is impossible. However, experimen- 
tal work by Watabe (1992; Watabe & Yamagishi 1994) found subjects were 
leery of adopting such a risky strategy, and field studies by Ostrom and her col- 
leagues (Ostrom et al 1994) uncovered no instances of groups actually using a 
trigger strategy in their community. 


SOCIAL LEARNING A different approach to solving N-person dilemmas has 
been investigated by Macy (e.g. 1993, 1991). His model of decision-making 
does not assume that actors calculate marginal rates of return or work out 
dominating strategies. Basing his work on the principles of social learning the- 
ory, he assumes reward-secking, penalty-aversive actors and asks under what 
conditions such cognitively modest actors might escape social dilemmas. In a 
series of computer simulations, he isolates a number of factors that can pro- 
mote cooperation, including the presence of thresholds and the tendency for 
actors to 1mitate those around them. | 


GROUP RECIPROCITY Finally, we return to the issue of group identity and its 
effects. Making group identity salient has been shown to increase cooperation. 
While work in social identity theory (Tajfel 1981) argues that simply catego- 
rizing individuals into a common group is enough to increase their altruism to- 
ward the group, research by Karp et al (1993; see also Jin et al 1996) contests 
this conclusion. The effects of group identity stem, they argue, not from an al- 
truism born of categorization, but from a belief in the interdependencies of 
group members and expectations of reciprocity among the members. In a se- 
ries of studies they carefully removed any possibility or connotation of inter- 
dependency and found that simple categorization was not enough to create in- 
group favoritism. 

It is the belief in future reciprocal exchanges between members, they argue, 
that moderates the temptation to defect and encourages cooperation. The ex- 
pectation of in-group reciprocity seems to serve as a very deep heuristic that 
shapes our strategic decisions (Jin & Yamagishi 1997, Brewer 1981). The ex- 
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pectation of reciprocity appears to be so great that it sometimes manifests itself 
even in situations in which reciprocity is not logically possible (Watabe et al 
1996, Hayashi et al 1997, Karp et al 1993). Further, this heuristic means that 
many Prisoner's Dilemma situations will be transformed into Assurance 
Games. Evidence for this transformation can be found in Watabe et al (1996) 
and Hayashi et al (1997). 


Structural Solutions 


In this section I relax the assumption that the rules of the game cannot be 
changed. Here I examine structural changes to social dilemmas that either 
modify the dilemma or eliminate it entirely. An important issue discussed be- 
low is how these structural changes are provided. 


ITERATION AND IDENTIFIABILITY One approach to structural solutions is to 
create or reinforce those features of the environment that are prerequisites for 
strategic solutions. Returning to Axelrod (1984), this approach suggests three 
changes: (a) Make interaction more durable or frequent; (b) increase identifi- 
ability; and (c) increase information about individuals’ actions. If individuals 
will not interact in the future, if identity is unknown or unstable, and if there is 
no recollection or record of past interactions, individuals will be motivated to 
behave selfishly because they will not be accountable for their actions. Know- 
ing the identity and history of a person allows one to respond in an appropriate 
manner. If information about individuals and their actions is shared among the 
group, this also encourages the development of reputations, which can be a vi- 
tal source of social information and control. These features will be important 
not just for facilitating strategic solutions but also as prerequisites for some of 
the other structural solutions discussed below, notably the use of monitoring 
and sanctioning systems. Along these lines, several studies have found that 
anonymity (the absence of identifiability) lowers rates of cooperation (Fox & 
Guyer 1978, Jerdee & Rosen 1974, Kahan 1973).!6 

However, it is important to note that ongoing interaction may not always 
have a salutary effect on social dilemmas. Axelrod was concerned with two- 
person Prisoner’s Dilemmas, and in N-person Prisoner’s Dilemmas (with no 
thresholds), there is a stronger temptation to move toward the equilibrium of 
zero cooperation, as defection has a smaller effect and one may not be able to 
impact others’ outcomes and so encourage cooperation. Ledyard (1995) dis- 
cusses this general issue and points to a number of studies that have found sig- 
nificant declines in cooperation over time in N-person dilemmas with no 


16Note that anonymity rn and of itself may not always have an effect (Kerr 1997) For example, 
if individuals do not understand the situation or do not care about the sanctions others impose, 
whether one 1s identifiable or not may not matter. 
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thresholds (e.g. Andreoni 1988, Banks et al 1988, Isaac et al 1985, Isaac e al 
1984, Kim & Walker 1984). 


PAYOFF STRUCTURE As one would expect, numerous studies have demon- 
strated that the greater the personal return from cooperation and the lower the 
return from defecting, the higher the levels of cooperation (Isaac & Walker 
1988, Issac et al 1984, Komorita et al 1980, Bonacich et al 1976, Kelley & 
Grzelak 1972). Perhaps more surprising is the finding that cooperation rates 
increase significantly as the benefits to others from one's cooperation increase 
(Bonacich et al 1976, Kelley & Grzelak 1972, Komorita et al 1980). This ar- 
gues that many people are positively weighting the outcomes of others. 

The nature of the public good and how it is distributed can also have an ef- 
fect. Alfano & Marwell (1980) found that cooperation levels were much 
greater when group members were asked to contribute to a public good that 
was nondivisible. That is, rather than each person getting an individual return, 
the group would receive a lump sum that had to be spent on a group activity. 
The very fact that the public good was indivisible may have helped reinforce a 
sense of group identity and interdependence among the subjects. 


EFFICACY Many researchers have argued that one of the key reasons people 
do not cooperate in an N-person dilemma is the fact that a single person’s ac- 
tions may have no discernable effect on the situation. No one will be fired and 
no program will go off the air if I do not send in a $30 contribution to public 
television, and even if I do conserve water in a drought, it will have no measur- 
able impact on the overall situation. 

If a dilemma is structured in such a way that individuals can have a notice- 
able effect on the outcome—that is, they can make an efficacious contribu- 
tion—cooperation rates can be increased. One way in which this can occur is if 
a public good has a step-level production function. If an individual believes the 
group is close to the threshold, then adding one's own contribution can be 
enough to put the group "over the top" and provide the good. One study (van, de 
Kragt et al 1983) found that groups who were attempting to provide a public 
good with a threshold designated a subgroup of contributors (via lottery or vol- 
unteering) who would be just enough to provide the good. In this situation each 
person within the minimally contributing set knew that the provision of the 
public good was critically dependent on each of their actions. Free-riding was 
IOPOSSIDIE and each knew that their actions were necessary for the Success ~ 
the group.!? 


V'Other work has shown that increasing the level of the threshold can increase the amount of 
cooperation, although this also decreases the probability that the threshold will be met (Isaac et al 
1988, Suleiman & Rapoport 1992) 
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Another experimental study by Bornstein et al (1990) demonstrates the 
joint effects of a step-level production function and group identity. The key in- 
novation in these studies (see also Rapoport et al 1989; Rapoport & Bornstein 
1987, 1989; Bornstein & Rapoport 1982) is that two groups are set up in com- 
petition against each other, with a prize going to the group that demonstrates 
the higher level of cooperation; the prize is then distributed equally to the win- 
ning group’s members. This changes the structure of what was originally a 
Prisoner’s Dilemma into a step-level public goods problem in which defection 
is no longer a dominating strategy. The creation of a step-level function (which 
is to say, a threshold point) shifts the structure of the game into an N-person 
version of the Chicken Game. 

Work by Kerr (e.g. 1992) has also shown that cooperation in a public goods 
dilemma is more likely the larger the impact of a person's contribution. A simi- 
lar strategy is used by public television and charities when they create “match- 
ing grants" in which someone agrees to double the contributions that others 
make. 

The perception of efficacy can be enough to affect cooperation. Kerr (1989) 
and Rapoport et al (1989) found a significant relationship between perceived 
efficacy and contributions to a public good. Survey and field research have 
also found that most individuals involved ın collective action believed that 
their actions had a significant effect on the provision of the public good, even if 
the size ofthe group was very large (Klandermans 1986, Mueller & Opp 1986, 
Moe 1980). 

The creation of efficacy, real or perceived, can be an art. Consider the diffi- 
culties ofa charity trying to raise money to feed poor children. A potential con- 
tributor may fear that her or his contribution will be wasted or wonder what 
good one person can do for an organization that raises millions of dollars. The 
response of at least one charity has been to assign each contributor a specific 
child. The contributor receives a photo and personal information about the 
child they are sponsoring and even an occasional letter from the child or one of 
the parents. The sense of personal responsibility it creates (“what happens if I 
stop contributing?") profoundly changes the decision of whether to cooperate. 


GROUPSIZE Numerous studies have found that cooperation declines as group 
size increases (e.g. Komorita & Lapworth 1982, Fox & Guyer 1977, Bonacich 
et al 1976, Hamburger et al 1975). The possible reasons for this effect are 
many. Returning to Dawes's (1980) points about differences between two- 
person and N-person games, increasing group size may spread the harm caused 
by defection, make it harder to shape others' behavior, and make it easier to de- 
fect anonymously. The costs of organizing can also increase as group size 
grows (Olson 1965)—groups can find it harder to communicate and coordi- 
nate their actions. The efficacy and visibility of one's actions can also be di- 
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luted, and monitoring and sanctioning the behavior of others (see below) can 
become more of a challenge. This general effect has led some commentatorsito 
argue in favor of anarchistic social systems in which communities are organ- 
ized as networks of small groups (Fox 1985). 

However, none of these effects are inevitable as groups grow in size (U dehn 
1993, Kollock & Smith 1996). Interestingly, some experimental work has 
found that the decrease in cooperation as group size increases tapers off 
quickly (Fox & Guyer 1977, Liebrand 1984), and other work has actually 
shown an increase in cooperation with larger groups (Yamagishi & Cook 
1993, Isaac et al 1990). Part of the problem in reaching any precise conclusion 
about the effects of group size is that so many elements can vary as group size 
increases. It is in the end impossible to control for all possible parameters in or- 
der to study a “pure” group size effect (Orbell & Dawes 1981, Ledyard 1995). 
Researchers must decide which parameters are most important and carefully 
control them. Another difficulty with researching this effect thoroughly is that 
one must examine groups of varying sizes, including large groups, and running 
experiments with large groups creates extraordinary logistical difficulties and 
costs. 

One explanation for why larger groups may be more likely to solve social 
dilemmas comes from Marwell & Oliver (1993). They argue that if a public 
good is highly nonrival, a large group is more likely to contain a critical mass 
of individuals whose interests are served by providing the good. One feature of 
a group that encourages the formation of a critical mass is the heterogeneity of 
the group in terms of the diversity of group members’ interests and resources. 
The importance of group heterogeneity in solving social dilemmas 1s also ex- 
plored by Glance & Huberman (1994). 


BOUNDARIES This set of structural solutions deals with a core characteristic 
of social dilemmas—the nonexcludability of a joint good. Each of these solu- 
tions attempts to draw some kind of boundary around the collective good. , 
One of the first solutions proposed for commons dilemmas is the establish- 
ment of an external authority to regulate who had access to the commons or 
how people were to withdraw resources from the commons. This is, in a broad 
sense, Hobbes’ classic solution of Leviathan: People give up some part of their 
personal freedom to an authority in return for some measure of social order. 
This is also the solution Hardin proposed in his famous article (1968) when he 
concluded that "freedom in a commons brings ruin to all." Hardin fully ac- 
knowledged that the outcome might be grossly unfair to some people, but 
given the global tragedy he felt was inevitable, he declared that "injustice is 
preferable to total ruin” (1968). This echoed Lloyd's (1832) grim conclusion 
that “To a plank in the sea, which cannot support all, all have not an equal 
right." A direct example of this strategy can be seen in the establishment of fish 
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and game authorities that set strict limits on what can be caught and the length 
of the season. À similar approach can be taken in public goods dilemmas, 
where an external authority compels individuals to contribute money (as when 
a government collects taxes to provide public services) or labor, such as in the 
case of military conscription. 

The willingness to hand over personal choice to a leader has been shown in 
some experimental studies. Messick et al (1983) and Samuelson & Messick 
(19862) found that a group that was overusing a commons was willing to 
change the structure of the situation by electing a leader who would manage 
the harvesting of the commons for the group. Interestingly, subjects did not 
usually vote for themselves, instead electing a person who counteracted the 
group's performance to that point in time: someone who harvested few re- 
sources from the commons if the group had overharvested, and someone who 
harvested substantial resources if the group had underharvested to that point 
(Messick et al 1983, Samuelson et al 1984). However, Samuelson & Messick 
(1986b) and Rutte & Wilke (1985) found that subjects preferred not to create a 
leader if other structural changes were possible. 

Severe problems can arise in establishing such an authority, as Crowe 
(1969) commented soon after the publication of Hardin’s article. According to 
Crowe, Hardin assumes (a) that the global community can come to an agree- 
ment about what to value and how to rank those values, (5) that authorities will 
have sufficient coercive force to compel people to obey, and (c) that authorities 
can be trusted to remain free of corruption and to resist the influence of special 
interest groups. Crowe vigorously questions each of these assumptions and ar- 
gues that even if Hardin's basic presumptions are correct, his solution is un- 
workable on a broad scale. 

Another commonly suggested solution to the tragedy of the commons is to 
privatize the commons, that is, to break the commons up into private parcels on 
the assumption that individuals will take better care of their own property than 
common property. Two experimental studies (Cass & Edney 1978, Messick & 
McClelland 1983) indicate that individuals did better at managing their own 
"private commons" than they did harvesting as a group. However, there are a 
number of difficulties with this solution. First, not all goods can be privat- 
ized—it may be easy to divide up an actual meadow,!9 but how does one di- 
vide up schools of fish in the ocean, clean air, or many public goods such as na- 
tional defense? Second, even if it is possible to divide up the common good, 
doing so raises grave questions about social justice: Who gets the newly pri- 
vatized commons, and how are the parcels allocated? To the highest bidder? In 
a lottery? Third, while it may be reasonable to expect people to take good care 


18 Assuming the meadow 1s homogenous, see Ostrom 1990, p 13. 
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of their own property, empirically there is no question that individuals rou- 
tinely destroy their own property. Some have argued that there are "tragedies 
of enclosure" (Bromely 1991) just as there are tragedies of the commons. Fi- 
nally, private property rights require a great deal of institutional support so that 
these rights can be enforced. 

Some of the other assumptions in Hardin's original analysis have also been 
criticized. Notably, he assumes that commons are always open-access, that is, 
that there are no restrictions as to who may use the commons. However, this 
assumption is neither necessary nor historically accurate (Fairlie et al 1994, 
McCay & Acheson 1987). In fact, commons are often surrounded by local 
rules of access and enforcement mechanisms. One of the key findings of field 
research done on how communities manage common property is that groups 
often do find ways to regulate their own actions, and some of these arrange- 
ments have proven to be remarkably robust, lasting across several generations 
(McCay & Acheson 1987, Ostrom 1990, 1992, Ostrom et al 1994). 

Thus, Ostrom (1990) proposes a third route away from the tragedy of the 
commons: the local regulation of access to and use of common property by 
those who actually use and have local knowledge of the resource. Ostrom i 180- 
lated a number of design characteristics that were shared by communities that 
had a long history of successfully managing common resources. The first char- 
acteristic she discusses deals explicitly with the issue of excludability: Suc- 
cessful communities are marked by clearly defined boundaries— "Individuals 
or households who have rights to withdraw resource units from the [commons] 
must be clearly defined, as must the boundaries of the [commons] itself" 
(1990, p. 91).1? | 

This is not to say that local communities inevitably solve their social dilem- 
mas—there is no shortage of true tragedies as well as victories—but it does 
make the essential point that it is inappropriate to conclude that the only way 
out of a commons dilemma is through the use of some form of Leviathan'or 
privatization. This has been the conclusion of a number of commentators who 
took Hardin's parable too literally. It is also the case that misguided interven- 
tion by an outside authority can take a bad situation and make it much worse 
(McCay & Acheson 1987). | 

| 
SANCTIONS As Dawes (1980) pointed out, one of the great challenges of N- 
person dilemmas is that it is often not possible to directly affect others' out- 
comes and so shape their behavior. If the cooperators could be rewarded for 
their action and defectors punished, even large-scale dilemmas might be 
solved. | 


I9 Other design features identified by Ostrom are discussed in subsequent sections. Many of 
these characteristics are applicable to public goods dilemmas as well. 
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Indeed, one of Olson's (1965) key conclusions was the necessity of using 
selective incentives in encouraging cooperation. A selective incentive is a pri- 
vate good given as an inducement to contributing toward a public good. Any- 
one can watch public television, but only subscribers receive program guides, 
discount cards, and other rewards for subscribing. We may all have access to a 
common pool of blood at the blood bank, but at the University of California at 
Los Angeles, only those who do contribute blood receive a half day off with 
pay, food and drink, and occasionally even the chance to win more substantial 
prizes in a lottery. Field research on conservation behavior (Maki et al 1978, 
Winett et al 1978) has shown that selective incentives in the form of monetary 
rewards are effective in decreasing the consumption of water and electricity. 

If carrots work, so do sticks. Experimental studies have shown that coop- 
eration is more likely if individuals have the ability to punish defectors (Cald- 
well 1976, Komorita 1987). Such negative sanctions are the complement of the 
positive sanctions used in selective incentive systems—the target in this case 
is the defector rather than the cooperator. 

However, implementing sanctioning systems raises two important prob- 
lems. First, there are often significant costs to providing these systems. In or- 
der to reward or punish individuals, one must first be able to monitor their be- 
havior (Hechter 1984). This may be trivially easy if we are working next to 
each other building an irrigation system, or essentially impossible, as when in- 
dividuals in a large city decide to leave the water running in the privacy of their 
home. Even if one is able to keep track of individuals’ actions, there are still 
costs in administering rewards or punishments. The rewards themselves can be 
costly, and administering negative sanctions can require the support of large 
and expensive institutions (e.g. a police force, an internal revenue service). 
Thus, it will sometimes be the case that the costs to monitor and sanction indi- 
viduals will be greater than the benefits that come in terms of higher coopera- 
tion. 

While monitoring and sanctioning costs can be very great, some situations 
exist in which the costs can be made very small through the right institutional 
arrangements (Ostrom 1990). In general, these costs can also be quite modest 
in small groups.?? Ostrom (1990) in particular documents the many ways face- 
to-face communities create local monitoring and sanctioning systems. The 
presence of a monitoring and sanctioning system run by the community mem- 
bers themselves (as opposed to an external authority) was one of the design 
features Ostrom found in each of the successful communities she studied. An- 
other common element Ostrom identified was that cooperative communities 


20While this 1s true m many cases, secret defection 1s possible in even the smallest group, and 
there are some situations where monitoring 1s easily accomplished even in very large groups (e.g. 
Kollock & Smith 1996) 


I 
' 
: 
i 
i 


206 KOLLOCK 


| 
employed a graduated system of sanctions. While sanctions could be as severe 
as banishment from the group, the initial sanction for breaking a rule was often 
very low. Community members realized that even a well-intentioned person 
might break the rules when facing an unusual situation or extreme hardship. 
Severely punishing such a person might alienate him or her from the commu- 
nity, causing greater problems. Ostrom also found that even with a well- 
designed internal monitoring and sanctioning system, some conflict was inevi- 
table. Thus, it was important that community members had access to low-cost 
conflict resolution mechanisms. 

The second key problem in implementing sanctioning systems is that these 
systems are themselves public goods because one can enjoy the benefits of a 
sanctioning system without contributing to its provision or maintenance. 
Whether the sanctions are provided by an external authority or locally, there i iS 
the temptation to free ride. The police and judicial system continue to work 
even if I avoid paying taxes, and if everyone else in my community takes on the 
task of informally admonishing and criticizing defectors, I can avoid the costs 
of such actions and still enjoy the benefits they bring. This raises the question 
of when people will cooperate in providing this second-order public good (i.e. 
a public good designed to be a solution to an underlying social dilemma). : 

The most extensive set of experimental studies on the provision and use of 
sanctioning systems has been by Yamagishi (1992, 1988, 1986). While gne 
might expect that people’s decisions when faced wıth a second-order dil 
mirror their actions when faced with a first-order dilemma, this turns out not to 
be the case. Yamagishi found that trusting individuals (as measured via a scale 
administered prior to the experiment) were likely to cooperate in a first order- 
dilemma, but when they were given the opportunity to contribute toward the 
provision of a sanctioning system, relatively few did so. In contrast, a group of 
distrustful individuals exhibited low levels of cooperation in the first-order di- 
lemma but were more willing to cooperate in the creation of a sanctioning sys- 
tem. The existence of a sanctioning system led to cooperation rates in the end 
that were similar to those of the trusting individuals. ! 

Why people might be willing to cooperate in a second-order dilemma is an 
area ripe for research. Studies by Heckathorn (1996, 1989, 1988), Axeliod 
(1986), and Yamagishi & Takahashi (1994) provide some initial investiga- 
tions based on computer simulations that examine the effect of collective sanc- 
tioning, “hypocritical” sanctioning (i.e. sanctions by actors who defect in 
first-order dilemma), and the evolution of traits that encourage sanctioning. 


CONCLUSIONS 


The study of social dilemmas is the study of the tension between individual 
and collective rationality. It is the study of tragic (deficient) equilibria caused 
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by externalities, that 15, uncompensated interdependencies. Social dilemmas 
are also a sensitive research domain, in that a great many variables can affect 
cooperation rates, and small changes in these variables can sometimes have 
large effects (Ledyard 1995). 


Studying Social Dilemmas 


One of the great advantages of doing research in social dilemmas is that a well- 
specified set of models exists that allows one to capture the key dynamics 1n a 
simple and tractable way in the laboratory. This is also one of the downfalls of 
research in the area—it is perhaps too easy to set up a social dilemma and vary 
any one of an infinite number of variables rather than thinking strategically 
about which situations and which parameters are most important. The Pris- 
oner's Dilemma Game in particular has served as a kind of readily available 
thermometer of cooperation that can be stuck into any situation. There 1s noth- 
ing to keep a researcher from examining the effects of eye color or pounds of 
meat consumed on cooperation rates, and some studies have come close to 
such esoterica. As Messick & Brewer (1983, p. 40) warned us at the end of 
their influential review, “There are more experiments that can be done than are 
worth doing and it is as important as it is tricky to determine which are which." 

Current experimental work has introduced a number of important innova- 
tions in the design of studies. In particular, a number of researchers have in- 
creasingly relaxed the constrained designs of early work. One can now find re- 
search in which actors have the option of leaving the interaction, of choosing 
new partners, and even of choosing the game structures (Kakiuchi & Yama- 
gishi 1997). Other researchers have developed designs that highlight the im- 
portance of the group, examining the effects of inter-group competition or the 
use of exclusion from the group (ostracism) as a sanction (Kerr 1997). 

There are some chronic problems, however, in the manner in which experi- 
mental research is being conducted. Many experiments in this area have used 
trivially small incentives. “It makes no sense,” said Orbell & Dawes (1981), 
"to spend large amounts of money for summer salaries, secretaries, computer 
terminals, and research assistants, and then motivate the subjects with micro- 
scopic amounts of money or course credits." The generalizability of our results 
is limited to the extent we use small incentives, and it is entirely possible that 
many ofthe inconclusive or contradictory results that are reported in the litera- 
ture are due in part to subjects being faced with outcomes that are trivial. It is 
also the case that with few exceptions, most of these studies have involved 
very small groups. Our results are thus limited again to the extent that a 10- 
person group is defined as a "large" group. 

Turning briefly to other methodologies, experimental work has been bol- 
stered by studies based on computer simulations that allow one to investigate 
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models involving very large groups, explore the logical terrain of one's theo- 
ries, study problems that do not lend themselves to analytical solutions, and 
develop tenable models that can guide behavioral experiments (for recent col- 
lections, see Liebrand & Messick 1996, Axelrod 1997). Also important in re- 
cent years has been the emergence of a great many field studies based on the 
logic of social dilemmas. These include many studies on resource manage- 
ment as well as fascinating accounts on such topics as the Sicilian Mafia 
(Gambetta 1993) and a study of trespass disputes among ranchers in northern 
California (Ellickson 1991). While one gives up the careful control of experi- 
mental work by moving into the field, one can examine situations involving 
truly large groups and significant (even life-threatening) outcomes. I believe 
the strongest work combines multiple methodologies. Yamagishi and his col- 
leagues (Yamagishi et al 1994, Yamagishi & Takahashi 1994, Yamagishi & 
Yamagishi 1994), for example, have combined simulations, survey research, 
and experimental studies in their research on trust and social dilemmas. Os- 
trom and her colleagues (Ostrom et al 1994) also have a long history of using 
multiple methods, combining field studies with experimental work. 


Future Directions 


In addition to the advances in research design and issues mentioned above, I 
believe especially promising directions for research in the future include work 
on the expectations and effects of generalized reciprocity within groups, the 
transformation of incentive structures, and a greater focus on the Assurance 
Game as a core model in understanding social dilemmas. | 

A great deal of attention has focused on how dilemmas might be structur- 
ally changed to reduce or eliminate the temptations to defect. But additional 
studies on how the incentive structure in dilemmas can be transformed via mo- 
tivational or strategic means are also crucial. We have seen that there appear to 
be stable personality traits that result in distinct transformations of objective 
payoffs, and work on group identity provides evidence of transformation of 
payoffs depending on group membership. The robust effects of group identity 
and the expectation of reciprocity imply that such issues as the construction of 
group boundaries and the signaling of group membership will be of fundamen- 
tal importance to the study of social dilemmas. Incentive structures can also be 
transformed via strategic mechanisms such as the adoption of a Tit-for-Tat 
strategy in a repeated Prisoner’s Dilemma Game. 

It is noteworthy that the result of many of these transformations is the fram- 
ing of the social dilemma as an Assurance Game. Working within an Assur- 
ance Game does not eliminate the challenge of cooperation, but it does change 
our focus in many ways. Trustworthiness, trustfulness, and all those factors 
that influence these concepts become even more important (cf Yamagishi & 
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Yamagishi 1994). Further, because the key issue in these dilemmas is the as- 
surance that others will cooperate, attempts to signal and advertise one’s com- 
mitment to cooperate will be critical. This might be as simple as a public 
pledge to cooperate or an act that is more symbolic.?! In this sense, signs that 
one is committed to a group or to a particular goal would be important in en- 
couraging others to cooperate (e.g. wearing a crucifix, a lapel pin from a frater- 
nal organization, gang colors, a union pin). More broadly, using the Assurance 
Game as one's model makes signaling and signal detection (or in the language 
of social psychology, dramaturgy and attribution) centrally relevant to a study 
of human cooperation.2? 

Transformations can also be important because they provide another poten- 
tial path to solving social dilemmas. Rather than trying to solve the dilemma as 
it exists, it may be easier to work to transform the dilemma to, e.g., an Assur- 
ance Game and then use another set of more viable strategies to encourage co- 
operation (Kollock 1998a, Yamagishi 1995). Bornstein et al’s (1990) research 
in which an N-person Prisoner’s Dilemma Game is transformed into a Chicken 
Game via inter-group competition is an example of such an approach. 

Ideally, we should pursue experimental designs that permit large groups, 
sizable incentives, and diverse populations. One possible approach to these 
challenges is to move away from physical group laboratories to experimental 
systems that are designed to make use of the many advantages of the Internet 
(Macy et al 1997). An experimental system based on the World Wide Web 
would make it much easier to run studies involving very large groups that are 
composed of more than just college undergraduates. Such a Web-based lab 
would also permit cross-national experiments, which would enable studies of 
cross-cultural interactions. One could even make use of monetary exchange 
rates in order to run experiments in countries where the incentives offered 
would be truly significant. 

Uniting all of these studies 1s a core set of social dilemmas that can be ex- 
plicitly and precisely modeled. This core set of models can serve as a kind of 
lingua franca for communication between disciplines. Researchers with very 
different goals and methodologies can map the results of each other's work 
onto their own through the use of these models. However, this potential is of- 
ten not realized, and it is often the case that scholars in different disciplines re- 
main unaware of each other’s work. One of the most important goals for future 
work is for researchers to become more aware of related literatures in neigh- 


21 An excellent example of these processes 1s given by Fantasia (1988) 1n his case study of a 
wildcat strike. 

22Note that dramaturgical issues are very important in a Chicken Game as well as m an 
Assurance Game, but for different reasons In a Chicken Game one can try to convince the other 
that one will certainly not cooperate in hopes of forcing the partner to do so 
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boring disciplines. In particular, I would encourage closer ties with experimen- 
tal economists, who have produced a very useful body of work on social dilem- 
mas (for a starting point, see Kagel & Roth 1995). We should work toward the 
integration of these various research traditions and the future collaboration of 
experimental social scientists across the disciplines of sociology, psychology, 
economics, political science, and anthropology. 
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ABSTRACT 


Historically, breakdown theory dominated the sociological study of collec- 
tive action. In the 1970s, this theory was found to be increasingly unable to 
account for contemporaneous events and newly discovered historical facts. 
Resource mobilization theory displaced breakdown theory as the dominant 
paradigm. Yet the evidence against breakdown theory is weak once a distinc- 
tion 1s made between routine and nonroutine collective action. Several recent 
contributions affirm the explanatory power of breakdown theory for nonrou- 
tine collective action. Breakdown theory also contributes to an understand- 
ing of the use of governmental force against protest and of the moral features 
of collective action. Breakdown and resource mobilization theories explain 
different types of phenomena, and both are needed to help account for the 
full range of forms of collective action. 


INTRODUCTION 


Breakdown theory is the classic sociological explanation of contentious forms 
of collective action such as riots, rebellion, and civil violence. The crux of the 
theory is that these sorts of events occur when the mechanisms of social con- 
trollose their restraining power. Breakdown theory was expressed in the main- 
stream of sociology by its standard-bearers: Comte through Durkheim, Gus- 
tave LeBon, and Gabriel Tarde in the European tradition; Robert Park and his 
student Herbert Blumer, Talcott Parsons and his student Neil Smelser in the 
American tradition. 
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Breakdown theory's dominance ended during the 1970s. Researchers 
claimed that breakdown theory could not account for societal events as they 
were then unfolding—the social movements and collective violence of the 
1960s and 1970s, nor for newly collected historical data. A new theory 
emerged to explain the anomalies. Resource mobilization (RM) theorists pos- 
ited that collective action flows not from breakdown but from groups vying for 
political position and advantages. Rebellion is "simply politics by other 
means," stated William Gamson (1990 [1975], p. 139). 

Resource mobilization theory gained quick acceptance, becoming by 1980 
the dominant paradigm (Zald 1992, p. 327). Breakdown theory fell so far that a 
study could be criticized by merely pointing out that it left variation to be ex- 
plained by breakdown processes. As an example, Anthony Oberschall (1973, 
p. 120) classified collectivities to greatly emphasize organizational ties within 
a community: whether they are organized along traditional/communal lines, or 
have little or no organization of any kind. He argued that in a segmented social 
structure (that is, the lower orders are not effectively integrated with the elite), 
collective action may arise under all three forms of horizontal organization. 
However, when the horizontal ties are weak or nonexistent, we should expect 
"short-lived, but violent... outbursts devoid of leadership, organization, and 
explicitly articulated goals" (Oberschall 1973, pp. 122-23). 

Charles Tilly (1978, p. 83), one of the architects of resource mobilization 
theory, portrayed this as an escape clause and implied that Oberschall had not 
emphasized strongly enough the influence of prior organization. Tilly would 
later affirm that 


? 
| 


solidarity, rather than insufficient integration, provides the necessary condi- : 
tions of collective action, and... rebellions, protest, collective violence, and | 
related forms of action result from rational pursuit of shared interests (1984: 
51-52). 


As used by Tilly and others, solidarity refers to dense social networks and a 
strong collective identity. Breakdown (malintegration) refers to weak net- 
works and a diffuse collective 1dentity often created by chronic aemp 
ment, family instability, and disruptive migration. 

Recent scholarship, however, suggests a need to reconsider RM gonsi 
monopoly over the field. The thesis of this essay is that breakdown theory and 
RM theory analyze different phenomena, and that the sociological terrain 
needs to be opened up for breakdown theorists’ insights. 


RM RESEARCH: THE NEGATION OF THE NEGATION? ' 


Few resource mobilization theorists have backed away from the claim that 
breakdown theory has been falsified by the evidence. The key research said to 
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refute breakdown theory can divided into (a) the work by Charles Tilly and his 
collaborators on collective action in a European setting; (5) studies of the or- 
ganizational bases of social movements; (c) analyses of the urban disorders of 
the 1960s; and (d) work on the connection between collective action and 
crime. 


Collective Action In European Setting 


The work of Charles Tilly and his colleagues has been pivotal in the swing 
away from breakdown theory. Tilly's group undertook a daunting task. They 
counted the incidence of collective action in several European countries over a 
100-year period, against which they sought to test breakdown and RM models. 
Few would challenge the enormous contributions (methodological, historical, 
and theoretical) made by this impressive body of research. Yet pace Tilly, the 
refutation of the breakdown theory is not one of them. 

That Tilly's research is not a crucial test of breakdown theory has been ar- 
gued by Frances Piven and Richard Cloward (1992). To back up for a moment, 
in his theoretical work, Tilly (1978, pp. 14-15, 23) recognizes that the heart of 
breakdown theory is a distinction between "routine collective action" (e.g. 
electoral rallies, peaceful protest) and "nonroutine collective" action (e.g. re- 
bellion, collective violence, nots). The difference, to expand on this point, is 
not so much that participants in riots and rebellion injure people and destroy 
property. Boxers, armies, and wrecking crews are paid to do physically similar 
things. Rather, it is that participants must free themselves from the restraints 
on behavior, both moral and physical, that society normally makes strongest 
and is most concerned about. Breakdown theory posits that only nonroutine 
collective action flows out of breakdown processes; routine collective action is 
said to arise from and reinforce solidarity. 

Yet when Tilly and associates measured the incidence of collective action, 
the routine/nonroutine distinction was lost. In one study, Shorter & Tilly 
(1974) sought to explain the timing of strikes in France over the period 
1865-1965. Piven & Cloward (1992, p. 306) point out that strikes became le- 
gal in France in 1865 and thus presumably were more akin to routine collective 
than nonroutine action. Shorter & Tilly made no additional effort to distin- 
guish between "routine" and “nonroutine” strike activity. The same problem 
occurs in the other key studies claiming to disprove breakdown theory, includ- 
ing those by Lodhi & Tilly (1973) and Snyder & Tilly (1972). 

Piven & Cloward (1992, p. 306) maintain that the flaw is fatal: "Taken asa 
whole, this corpus of research does not answer the question of the conditions 
under which ordinary people do in fact resort to violence or defiance, and the 
findings cannot be taken to refute the [breakdown] perspective." This criticism 
has gone unrebutted and is, in my opinion, fair. 
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Furthermore, the empirical case in favor of resource mobilization as a re- 
placement for breakdown theory is also problematic. I focus on one study be- 
cause it raises a theoretical issue discussed below. In building the RM model, 
Snyder & Tilly (1972) argue that high levels of governmental repression 
should increase the cost of collective action and thus reduce the likelihood that 
groups will be able to mobilize and make demands. To test this hypothesis, 
they regressed the number of collective action participants for each year in 
France between 1830 and 1960 on three separate indicators of repression: (a) 
size of the national budget, measuring the "bulk" of the government; (b) 
person-days in jails for each year, including for ordinary crime; and (c) “excess 
arrests," a five-year lagged variable measuring whether, in a given year, there 
were more arrests than would otherwise be expected from the number of col- 
lective action participants in that year and the overall pattern of d 
pants for the entire 131-year period. 

The regression coefficient for the national budget is in the predicted direc- 
tion; the one for person-days in jails is also in the predicted direction but weak; 
and the one for excess arrests is in the opposite of the predicted direction and 
weak. One could argue (although Snyder & Tilly do not) that excess arrests is 
the only direct measure of governmental repression; the other two are so dis- 
tant from government repression as to be irrelevant. If so, the results seem to 
suggest not only that repression does not work, but that it may slightly increase 
collective action. In fairness, though, Snyder & Tilly note that the three meas- 
ures of repression are less than ideal, each presenting “some difficulties" 
(1972, pp. 527-28). 

The point here is the tenuous nature of evidence assembled by the Tilly 
group with regard to the causes of nonroutine collective action. The point is not 
trivial, given the impact of the research on the field. | 


Secondary Groups and Social Movements 


RM theorists argue that preexisting organization, both formal and informal, fa- 
cilitates collective action. Organization provides resources, such as pooled la- 
bor and leadership; it schools participants in civic cooperation and public 
mindedness, and extends the interpersonal bonds through which recruitment 
takes place. Resources, public spiritedness, and social bonds, in turn, help 
make possible the hard work often needed to sustain collective action. Organi- 
zation also permits the “bloc” mobilization of preexisting groups directly into 
movements. 
The finding of a positive association between organization and collective 
action has been replicated in dozens of studies and is irrefutable—at least in re- 
gard to certain forms of collective action. These forms include (a) community- 
based protest movements, such as those over abortion (Luker 1984, Staggen- 
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borg 1991), property taxes (Lo 1990), and school busing to achieve racial inte- 
gration (Useem 1980); (b) elite-supported protest movements, such as Moth- 
ers Against Drunk Driving (McCarthy et al 1988, Weed 1987), and social 
movements employing disciplined civil disobedience, such as the US civil 
rights movement (McAdam 1986, Morris 1984). Yet this research contradicts 
breakdown theory only if the collective action being studied falls into the non- 
routine category. 

While no one has been able to define exactly what “routine” and “nonrou- 
tine" mean, some cases are easily classified. By almost anyone's standard, the 
riots that occurred in US cities in the 1960s were nonroutine. They entailed 
massive looting, widespread violence, and direct defiance of orders by law en- 
forcement agencies to disperse. If the evidence on these disturbances runs 
against breakdown theory, the position is damaged. (This evidence is reviewed 
in the next section.) 

Other collective action noted above just as clearly belongs to the routine 
category. Mothers Against Drunk Driving, for example, looks a lot like the 
other pluralist interest groups that routinely vie for political influence. 
MADD’s tactics include public and youth education, lobbying for tougher 
laws against drunk driving, assisting victims of drunk drivers, and urging po- 
lice to beef up their enforcement efforts. Breakdown and RM theorists alike 
would anticipate that MADD would be a product of something other than so- 
cial breakdown. The evidence bears this out. For example, Frank Weed (1987, 
p. 264—65) reports that, based on a 1985 survey of local chapter officers, the 
typical MADD activist was 41 years of age, married, had attended college, 
held a high-status job, and was involved in one or more other community or- 
ganizations. John McCarthy and colleagues (1988) found that counties with 
more affluent and highly educated residents were more likely to have MADD 
chapters than counties with less affluent, less educated residents. 

There are also hard cases. Eric Hirsch (1990) tested breakdown and re- 
source mobilization theory against data on student involvement in a 1985 
anti-apartheid protest movement at Columbia University. On the one hand, the 
students' defiant tactics—they blockaded the administration building for three 
weeks demanding that the University sell its stock in companies doing busi- 
ness in South Africa—suggest nonroutine collective action. On the other hand, 
these were students at one of the country's elite universities, far from the sort 
of setting or population that breakdown theorists have in mind. 

Indeed, Hirsch's quantitative data, derived from a survey of undergradu- 
ates, seem to support a middle-case interpretation. Hirsch regressed a measure 
of protest participation on ten independent variables. Five of the ten measured 
either general ideological orientation or attitudes specific to the situation itself, 
such as the belief that divestment would influence the South African govern- 
ment. None of these five variables appears to have a direct bearing on the 
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breakdown and RM positions. The other five independent variables do. They 
include membership in campus political action organizations, membership in 
other types of campus organizations, and a dummy variable for freshman 
status. 

All five of the general variables have a significant impact on menda 
together explaining 59% of the variance. None of the five breakdown/RM 
variables has significant regression coefficients. In other words, student pro- 
testors were neither more nor less likely than other students to be a member of 
a campus political action organization, any other type of campus organization, 
or a member of the freshman class. If the results can be interpreted as failing to 
support the breakdown theory, as Hirsch (1990, p. 251) argues, so too for the 
RM position. 

In sum, resource mobilization theorists have demonstrated that prior or- 
ganization has a great deal to do with the ability of a group to act. Intuitively it 
makes sense that routine social movements draw on the resources embedded in 
their communities such as trust, information, and skills in civic participation. 
But that does not exhaust the question of what conditions generate nonroutine 
collective action. 


Revisionist Studies of Urban Riots of the 1960s | 


Resource mobilization theorists commonly assert that the same forces that 
generate social movements are also responsible for the outbreak of urban col- 
lective violence. The Tillys (1975, p. 290) state that ^no matter where we look, 
we should rarely find uprooted, marginal, disorganized people heavily in- 
volved in collective action." This expectation, they continue, is borne out by 
the evidence on the urban riots of the 1960s (1975, p. 291—94). If this ise: 
ment holds up, the breakdown position is damaged. 

Research on the urban riots of the 1960s took one of two approaches. Some 
studies explained riot participation by determining the characteristics that 
tinguished rioters from nonrioters. Other studies compared cities that had rot 
to those that did not. | 


STUDIES USING INDIVIDUAL-LEVEL DATA The most important individual- 
level data were collected in connection with the National Advisory Commis- 
sion on Civil Disorders (NACCD 1968a,b). The data sources included: (a) a 
survey conducted in the first three months of 1968 in 15 cities under the direc- 
tion of Angus Campbell and Howard Schuman; (b) surveys conducted in New- 
ark and Detroit after major riots ın those cities in 1967; (c) police records ofi in- 
dividuals arrested during the riots in a number of cities. 

The Campbell/Schuman (NACCD 1968b, p. 12) survey obtained data kon 
2800 African Americans in 15 cities. While too few respondents admitted to 
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riot participation to allow for analysis (1968b, p. 54), the questionnaire did in- 
clude a number of items concerning attitudes toward rioting and other types of 
collective action. Abraham Miller and colleagues (1976) have reanalyzed 
these data in a way that is particularly useful for present purposes. 

Miller distinguished among four groups: apathetics, nonviolent protestors, 
violence-prone protestors, and the riot prone. He found that "violent protes- 
tors" were a “hybrid” group, with very mixed attitudes and demographic char- 
acteristic. For this reason, Miller focused on two "pure" groups: nonviolent 
protestors (seemingly akin to actors engaged in routine collective action) and 
the riot prone (seemingly akin to actors engaged in nonroutine collective ac- 
tion). 

The data indicate that riot-prone respondents, compared to the nonviolent 
protestors, tended to have relatively low levels of education and income, to be 
socialized in a broken home, to be unmarried, and to be relatively young 
(Miller et al 1976, p. 361). For example, 7196 of the respondents at least 44 
years of age and in the highest occupational category were protestors, but less 
than 3% of the same group were rioters. In contrast, 22% of the 16—19-year 
olds who had unskilled occupations were rioters, but only 1596 of this group 
were protestors (Miller et al 1976, p. 362-363). Miller concludes, 


If such measures as being married, being reared in an intact family, and at- 
taining the relatively upper rungs of occupation, income, and education lad- 
der can be justifiably considered as measures of social integration then it 1s 
clear that rioters... emerged from the least socially integrated and lower ele- 
ments of the community (Miller et al 1976, p. 361). 


These findings are consistent with a breakdown explanation of nonroutine col- 
lective action. 

The results of the surveys conducted of residents in Detroit and Newark 
yielded results less favorable to breakdown theory. The Detroit sample were 
interviewed two weeks after a major riot in that city; the Newark sample about 
six months after the riot there. From these data, the National Advisory Com- 
mission reported that “there are no substantial differences in unemployment 
between the rioters and the noninvolved” (NACCD 1968a, p. 75). 

While this finding continues to be widely cited as evidence against break- 
down theory (e.g. Skolnick & Fyfe 1993, p. 76), it should be regarded cau- 
tiously. First, the tables have small N's: 154 and 189 for the Detroit and New- 
ark samples, respectively. For the Detroit sample, there were only 27 rioters in- 
terviewed, to be distributed into the employed or unemployed cells; for the 
Newark sample, 84 rioters were distributed to the two cells. 

Second, the Newark respondents (but not the Detroit respondents) were 
asked if they had been unemployed for at least a month during the previous 
year. Fifty-four percent of the rioters, compared to 37% of the nonrioters, 
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stated that they had been so unemployed, a difference that is statistically sig- 
nificant. If the rioters were more likely than nonrioters to have been unem- 
ployed in the previous year, it is not clear why this is not reflected in rates of 
current unemployment. 

Third, the overall levels of unemployment were “extremely high” 
(NACCD 1968a, p. 75)—about a third of both samples. Apparently, Detroit 
and Newark were communities in which work had already begun to “disap- 
pear" (Wilson 1996). One could argue that these high levels of unemploy- 
ment made rioting more likely in those communities, compared to communi- 
ties with full employment, even if unemployed individuals were no mere 
likely to riot than were employed individuals. This community-wide effect of 
unemployment should show up in city-level data. It does, as we see in the next 
section. 

The Commission (NACCD 1968b, p. 247) also reports data from nine cities 
on the proportion of riot arrestees who had been previously arrested. This pto- 
portion ranged from 39% (Buffalo, New York) to 100% in two cities (New 
Brunswick and Elizabeth, New Jersey). The Commission discounts the signifi- 
cance of these findings, pointing out that “50 to 90 percent of the Negro males 
in the urban ghettos have criminal records" (1968b, p. 237). From this and 
other considerations, the Commission concludes that “the criminal element is 
not over-represented among the rioters" (1968b, p. 237). This conclusion has 
also been widely cited by resource mobiliation theorists as part of the readily 
available evidence against breakdown theory. | 

Still, these findings can be reconciled with breakdown theory. If there is any 
realism to the upper-end figure of 90%, this would suggest that prior criminal- 
ity could not predict which inner-city residents rioted (because there is little 
variation), but it might still be able to explain why riots occurred there and not 
elsewhere. One could argue that in areas in which the arrest rates are as high as 
90%, the stigma of arrest losses its inhibiting effect, including for the criminal 
act of rioting. This remains speculative. | 

| 

| 
STUDIES USING CITY-LEVEL DATA ‘The most influential research using the city 
as the unit of observation was reported in a series of papers by Seymour 
Spilerman (1970, 1971, 1976). Spilerman found that only two city-level char- 
acteristics were significantly related to riot occurrence and intensity: (a) the 
numerical size of the African-American population, and (b) location outside 
of the South. All other community characteristics, including unemployment, 
education, and income, had no independent effect. Spilerman argues that the 
factor most responsible for riots, overshadowing any effects of community 
conditions, was the “widespread availability of television and its network news 
structure" (1976, p. 790). Television brought scenes of the civil rights move- 
ments into “every ghetto.” This, in turn, “contributed in a fundamental way to 
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the creation of a black solidarity that would transcend the boundaries of com- 
munity." 

The theoretical significance of these findings for breakdown theory, as well 
as the findings themselves, remain open to challenge. First, with regard to the 
region effect, Piven & Cloward (1992, p. 312) argue that noting may have oc- 
curred outside the South because "northern ghettos were less cohesive than 
southern black communities." This does not show up in the analysis because 
“cohesiveness” 1s not measured directly by Spilerman. The same point could 
be made about the effect of black community size: large black communities, 
compared to smaller ones, may have had weaker social controls, especially un- 
der conditions of high unemployment. Additional evidence on these points is 
needed. 

Second, Spilerman adduces no evidence that television had the impact he 
said it had, that is, to create greater solidarity among 1nner-city African Ameri- 
cans. His argument is plausible, but not self-evident, especially in the absence 
of an independent measure of solidarity. An indirect measure 1s the crime rate. 
If black solidarity was increasing in this period, there should have been a corre- 
sponding decline in the crime rate. Yet the crime rate for African Americans 
increased rapidly in this period (LaFree & Drass 1996, p. 622). Even if we 
were to assume that television was a key independent variable, the causal 
mechanism may differ from the one Spilerman identifies. Robert Putnam 
(1995) argues that television viewing (which is concentrated among the less 
educated sectors) fragments communities and reduces solidarity. Perhaps that 
effect was present too, or instead. 

Finally, Spilerman's key empirical finding—that community conditions 
were irrelevant to the outbreak of violence—has been challenged. In one 
study, Susan Olzak and Suzanne Shanahan (1996, p. 946) found that "cities 
with higher rates of unemployment for blacks had significantly higher rates of 
unrest," at least under conditions of heightened inter-minority competition. In 
another study, Daniel Myers (1997, p. 110) reported that the number of non- 
whites unemployed in a given city had a strong effect on riot rates, although he 
cautions that further research 1s needed to determine “exactly how unemploy- 
ment contributes to civil unrest." These two studies, relying on data that are 
more complete than were available to Spilerman, help shift the burden back to 
the critics of breakdown theory. 


Collective Action, Crime, and Age 


From its founding, breakdown theory has hypothesized links between riots/re- 
bellion and other signs of breakdown, such as crime One strategy to test this 
link has been to examine whether rates of collective action and rates of crime 
track together over time. The Tillys (1975) found that crime and collective ac- 
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tion varied independently in France, Italy, and Germany from 1830 to 1930./In 
contrast, Ted Gurr (1976) established that crime waves tended to occur during 
periods of high levels of civil strife in the four areas he studied (London 
Stockholm, New South Wales, and Calcutta). 

In a more recent study, Gary LaFree and Kriss Drass (1997) examined the 
covariation between rates of crime and collective action in the United States 
over the period 1955 to 1991. They found that crime rates and collective action 
rose in tandem from 1955 to the early 1970s but, after that, crime rates and col- 
lective action were negatively related. 

LaFree & Drass (1997, p. 849) argue that the first half of the time series sup- 
ports breakdown theory: Crime and collective action seem to “spring in part 
from the same social forces." The second half of the time series, though, is 
more consistent with RM theory, the authors maintain. They argue that the two 
rates departed because collective action, but not crime, requires organizational 
resources, which ostensibly became tighter in the post-1974 period. Also, 
beefed-up law enforcement had a stronger effect on riots than on crime. Addi- 
tional work needs to be done on the first point, because LaFree & Drass do not 
show there was an actual decrease of resources. “Resources,” of course, must 
be measured independently of the incidence of collective action. | 

Another strategy would be to examine the pattern of correlations between 
(a) crime and collective action (routine and nonroutine) and (5) other relevant 
variables such as age. The premise here is that like phenomena correlate in i e 
ways. | 

It is ancient criminological wisdom (supported by a large body of evidence) 
that young people commit crime at relatively high rates. For most crimes, rates 
of commission peak in the late teen years and then fall to half their peak iet 
by the mid-twenties (Blumstein 1995, Gottfredson & Hirschi 1990, 'p. 
123—126). If crime shares properties with rioting, but not routine collective dc. 
tion, we would anticipate that age would correlate with rioting in a similar Way 
but not with routine collective action. This 1s what we find. 

From its multiple data sources, the 1968 National Advisory Commission on 
Civil Disorders concluded that rioters were predominantly “late teenagers or 
young adults" (NACCD 1968b, p. 74). For example, in the Detroit survey 
noted above, 61% of the self-reported rioters were between the ages of 15 and 
24, and 86% were between 15 and 35. Moreover, this correlation appears to be 
stable over time. A study of the rioters arrested in the 1992 Los Angeles riots 
found that their age distribution was almost identical to the age distribution|of 
those arrested in the 1965 Los Angeles riot (Petersilia & Abrahamse 1994,\p. 
140—144). This suggests that the impact of age on rioting is a life-cycle effect 
(that is, rioting 15 something that people do when they are young but not older) 
rather than a period or generational effect. j 
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In contrast, a similar pattern of correlations is not found between age and 
participation in routine forms of collective action such as voluntary organiza- 
tions and voting. Affiliation with voluntary organizations is greatest among 
the group between the ages of 30 and 49, somewhat lower for those over 50, 
and much lower for the group 17—29 (Verba & Nie 1972, p. 181). And, control- 
ling for additional demographic variables, age increases the likelihood of vot- 
ing (Wolfinger & Rosenstone 1980, p. 47). Of course, many phenomena are 
correlated with age (playing volleyball, for example) but have nothing to do 
with crime. Still, the point is that crime and nonroutine collective action seem 
to share a similar place in the life-cycle, one not shared by routine collective 
action. 


RECENT ADVANCES 


The resource mobilization consensus notwithstanding, a number of recent 
studies have contributed to the development of the breakdown position. The 
most fully developed of these, by Frances Piven and Richard Cloward (1977), 
argues that the leading social movements of the 1930s and the 1960s were 
products of social breakdown: for the former, the dislocations produced by the 
Great Depression; for the latter, the uprooting migration and modernization 
that followed World War H. Along the same lines, Jack Goldstone (1980) 
found that protest groups being active during periods of social crisis or break- 
down is a better predictor of success than their organizational strength or tac- 
tics. Finally, Useem and colleagues link breakdown in institutional effective- 
ness to the spread of the 1992 riot in Los Angeles (Useem 1997) and to the oc- 
currence of the US prison riots (Useem 1985, Useem & Kimball 1989, Useem 
et a] 1996). Here I highlight several additional recent contributions. 


Spiral of Ethnic Conflict 


Two recent studies of ethnic collective violence come to similar conclusions, 
which in turn are broadly consistent with a breakdown model. Psychoanalyst 
Sudhir Kakar (19962) examined collective violence between Muslims and 
Hindus in the Indian city of Hyderabad. Sociologist Anthony Oberschall 
(1997) analyzed the collective violence that followed the breakup ofthe Yugo- 
slav state. In both studies, the explanadum is not ethnic conflict per se, but its 
form of unrestrained violence. In Hyderabad, the Hindu-Muslim riots took 
anything but a ritualized form. Rather, they were infused with deadly intent, 
accompanied by the destruction of temples, mosques, and shrines and the 
burning of houses. The vocabulary, on both sides, was victory with honor and 
defeat with humiliation and deep emotional wounds. In the former Yugoslavia, 
the conflict entailed the expulsion of a quarter of the population from their 
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homelands ("ethnic cleansing"), torture, mass executions and rape, and the 
sniper-killings of civilians. 

Both Kakar and Oberschall argue that extreme collective violence nid 
out of a spiral of conflict, polarization, and ultimately bloody, mutual recrimi- 
nations. In both case studies, breakdown elements permitted the — 
to begin and to continue. 

In Oberschall's account, the fall of the Yugoslav state was followed by a 
disintegration of the other institutions, including the economy. In the void, ex- 
tremist leaders formed militias, targeted moderates for refusing to go alohg 
with them, and recruited members from large pools of unskilled men. As the 
economy further deteriorated with the start of the war, a criminal economy 
took its place, fueled by widespread looting, ransom of captured civilians, and 
seizure of property from those expelled from their homes and homeland. 

On a theoretical level, Oberschall’s analysis is instructive for two reasons. 
First, classic breakdown theory pictures collective violence as a by-product of 
impersonal, structural change. But Oberschall shows that, in this case, human 
agents were causing the breakdown, decisively and purposefully. Breakdown 
of the social order was part of the strategy of conflict escalation, which then 
fed on itself. 

Second, Oberschall’s analysis suggests that the distinction between break- 
down and solidarity is too simplistic. Much depends on the kinds of groups that 
are breaking down and the kinds that are forming. The place, timing, and mood 
of these processes are also important. Francis Fukuyama (1996, p. 15) makes a 
relevant point: “It is as if there is a natural, universal human impulse toward 
sociability, which, if blocked from expressing itself through social structures 
like the family or voluntary organizations, appears in the forms like criminal 
gangs." In Bosnia, social decomposition was a precursor to the solidaristic 
groups that came to dominate the situation. These groups, moreover, appear to 
have been closer to Fukuyama's "criminal gangs" than the sorts of community 
groups and networks of solidarity that RM theorists have in mind. | 

In explaining Hindu-Muslim violence, Kakar (1996a,b) distinguishes 
community identification from communalism. The latter as used in the In- 
dian context refers to a dominating sense of community identification, in 
which a sense of ^we-ness" is replaced by “we are." This exclusive attachment 
to one's community is accompanied by hostility toward those communities 
that share a political and geographic space. Riots originate not only in the 
minds of men and women, according to Kakar, but early in their childhoods. 
Drawing on the work of Erik Erickson, Kakar finds that communalism be- 
comes imprinted during the same developmental stages in which a child ac- 
quires a sense of self. | 

For ethnic/religious conflict within limits to turn to murderous ethnic con- 
flict without limits, two shifts have to occur. At the cognitive level, the com- 
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munal identity must take on overwhelming salience in a large number of peo- 
ple at the same time. Social identity comes to dominate if not to displace per- 
sonal identity. At the affective level, love for one's group and hatred toward 
out-groups must be rekindled from feelings first developed in childhood. 
Drawing on psychoanalytic theory, Karkar (1996a, p. 43) refers to the preoc- 
cupation with minor differences, which cause greater hostility than do wide 
differences, as a key source of hateful representations of the other side. In 
Hyderabad, an initial attack by one side triggered fears of group annihilation in 
the other. Fueled by rumor and stoked by religious extremists, these fears gen- 
erated a spiral of attacks and counterattacks. 

Kakar's analysis is particularly instructive because he contrasts it with an 
explanation more consistent with the resource mobilization position: The 
Hindu-Muslim riots flowed out of group struggle over territorial control and 
political power (19962, pp. 21-23, 194—196). The problem with this explana- 
tion 1s that it cannot account for the extraordinary nature of the riots: people, 
often neighbors, killing and maiming one another, rarely experiencing shame 
or guilt for the violence inflicted. 


Disruption of Quotidian 


David Snow and colleagues (1998) have recently advanced a version of the 
breakdown model that incorporates two other sets of theoretical insights into 
the model: prospect theory, as developed by Kahneman & Tversky (1979), and 
cultural theory, as developed by Bourdieu (1975), Schutz (1962), and Snow & 
Benford (1992). Prospect theory maintains that people frame their decisions in 
terms of gains or losses from their status quo or zero point, and losses loom 
larger than corresponding gains. The relevant element of cultural theory is the 
concept of “quotidian,” the taken-for-granted attitude of everyday life and ha- 
bituated routines. The quotidian normally keeps life stable and operating on an 
even keel. Examples of the quotidian include middle-class people going to 
work and making mortgage payments and homeless people making do with a 
meager but steady supply of provisions. 

The synthesis of prospect and cultural theories 1s this. Social breakdown 
both (a) generates losses which, in turn, are experienced as highly salient dep- 
rivations (prospect theory), and (6) undercuts actors’ confidence that their ac- 
customed routines can continue to provide a satisfying future (cultural theory). 
It is this conjuncture of suddenly imposed deprivations and an uncertain future 
that gives rise to anger, indignation, and revolt. Snow and colleagues flesh-out 
(and to a degree, test) their position with several case studies, including work 
on homeless mobilization in eight cities, the mobilization of citizens following 
the 1979 accident at the Three-Mile Island nuclear power plant, peasant rebel- 
lions, and a prison riot. 
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In sum, even if one were to agree that breakdown theory has been an intel- 
lectual “straightjacket” (Gamson 1990, p. 130), this impressive paper by Snow 
and colleagues demonstrates that this need not be the case. It shows that break- 
down theory can be stretched in different directions, here to incorporate recent 
advances in phenomenology and cognitive psychology. The key argument— 
that the collapse of everyday routines changes the cognitive and affective con- 
tent of actors’ minds, setting the stage for rebellion —seems plausible, if un- 
proven. This refreshing analysis should open up new avenues of research. 


GOVERNMENTAL “REPRESSION” 


The reader might suspect that the wedge between breakdown and resource mo- 
bilization theories runs deeper than the causes of collective action per se. One 
issue concerns the properties of governmental force against protestors and riot- 
ers. A second issue, contrasting moral sentiments, is discussed in the next sec- 
tion. 

As is often the case, Tilly provides the clearest explication of the logic, of 
the RM position. He challenges the breakdown position that there exists a 
meaningful distinction between "legitimate" and "illegitimate" force. | 
The very same acts, indeed, switch from illegitimate to legitimate if a consti- 
tuted authority performs them. Killing appears in both columns, but with _ 
very different values. The values depend on whether the killer is a soldier, a 
policeman, an executioner, or a private person. (1984, p. 56) 


Tilly adds that, as a practical matter, he would call the police if someone stole 
his wallet. Still, he believes that the distinction between legitimate and illegiti- 
mate force "should never have entered the world of systematic explanation" 
(1984, p. 56). Finally, Tilly seems to discount the possibility that riots are 
genuinely frightening to the public. The term “riot” itself, Tilly argues, is 
merely a “legal device" that authorities use to justify the use of force koa 
assembled citizens. 

These arguments, in turn, permit Tilly to define “governmental E 
as any governmental action that raises the “costs” of collective action. As 
noted earlier, Tilly used this definition of repression in his work on European 
collective action, and it has been adapted widely by others in the resource mo- 
bilization tradition (Opp & Roehl 1990, Tarrow 1994, p. 92-93, McAdam 
1982, p. 218—229). RM analysts now routinely refer to the “repression works" 
hypothesis. | 

Breakdown theorists believe that the distinction between legitimate and il- 
legitimate force is valid even, indeed especially, i in regard to governmental use 
of force. In this view, "repression" is reserved for governmental action that 
violates legal rights and/or political norms. The supposition is the existence of 
a set of stable political standards (a) against which public officials are held ac- 
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countable, (6) from which departures are exceptional rather than routine, and 
(c) that permit citizens to know, with some certitude, the circumstances in 
which the government will use force against collective action (Allen 1996, 
Hayek 1944). At the extremes, cases are easily distinguished: police shooting 
peaceful protestors (repression) versus police arresting looters or even using 
deadly force when needed to protect an innocent third party from a murderous 
assault (lawful exercise of state authority). 

Furthermore, breakdown theorists would insist that government action 
against rebellion may involve more than an attempt to "raise its costs." Funda- 
mental to virtually all legal systems—modern and premodern, East and 
West—is a distinction between criminal and civil law (Cooter 1984, Robinson 
1996). Civil law "prices" or raises the costs of certain behavior, either to dis- 
suade it, provide a remedy to an injured party, or facilitate an equitable distri- 
bution of losses. The priced behavior may warrant reproval, but its wrongful- 
ness does not rise to the level of a criminal act. Criminal law, in contrast, en- 
tails more than raising the costs of certain behavior. It attempts to convey soci- 
ety's moral condemnation for the prohibited behavior, sometimes in the most 
unequivocal and dramatic manner possible: imprisonment or even death. 

RM theorists use the vocabulary of civil law, rather than criminal law, as if 
to assume away the possibility that government action against rioters/protes- 
tors represents a moral consensus within a community. Breakdown theorists 
would look for variation across situations. The concept of a general state of 
fear and the rights of potential riot victims provides a normative basis for gov- 
ernment action against rioters. From the point of view of breakdown theory, 
"repression" cannot be measured by counting the number of protestors ar- 
rested and shot by police. One must determine, as well, the existence of a moral 
consensus and/or legal justification behind those arrests and shootings. While 
this makes governmental repression hard to isolate and measure, it also argues 
against supposing that all governmental action against rebellion 1s repression. 

These arguments are open to empirical testing. To illustrate, one sort of evi- 
dence would be effects of riots on handgun ownership. By way of background, 
criminologists have found that handgun purchases and crime rates covary over 
time. While in principle the causal direction could go either way, the bulk of 
evidence supports the “fear and loathing” hypothesis: when crime rates rise, 
people buy handguns out of fear (McDowall 1995). Thus, if riots have the ef- 
fect of generating handgun purchases, then it could be reasonably inferred that 
they strike fear into the public. 

The evidence supports this argument. David McDowall and Colin Loftin 
(1983) found that requests for handgun permits rose dramatically after major 
riots in Detroit in the mid-1960s. Charles Clotfelter (1981) found the same ef- 
fect in the six states that he studied. By Clotfelter's calculation, a 50% increase 
in civil disorders increased handgun purchases by 5%. 
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Figure 1 extends these analyses. It plots handgun production for domestic 
sales, homicide rates, and a measure of total riot activity per annum in the 
United States between 1964 and 1994.! 

A visual inspection of the graph suggests, first, that homicide rates and 
handgun rates track together closely, per the fear-and-loathing hypothesis. 
second, in certain years, handgun production jumps above what would be an- 
ticipated from homicide rates alone; these years, in turn, seem to correspond to 
periods in which there are upsurges in rioting. 

Both observations are confirmed by regression analyses, in which riot in- 
tensity and homicide rates in one year were used to predict handgun produc- 
tion in the subsequent year. Riot intensity per annum was recoded to a dummy 
variable: Each year was coded as one of, or not one of, the top fifth of the riot- 
intense years. Results from standard OLS regression indicated (via the Durbin- 
Watson statistic) significant autocorrelation. Further investigation suggested 
that the error structure could be characterized by a first-order autoregressive 
process. In a subsequent regression both coefficients were statistically signifi- 
cant (p « .001) and in the predicted direction. 

In sum, both increased homicides and widespread rioting appear to strike 
fear into others at a level sufficient to generate gun purchases. This speaks, to 
the variable role that collective action plays in democratic societies, depending 
on its form. Ronald Dworkin (1985, p. 104—116) has argued that civil disobe- 
dience has earned a legitimate if informal role in the US political system. Few 
now regret nor condemn the Civil Rights movement and its most celebrated 
uses of civil disobedience, for example. Yet if civil disobedience 1s no longer a 
"frightening idea in the United States," as Dworkin (1985, p. 105) argues, the 
evidence above suggests the same is not true for urban riots. 


MORAL SENTIMENTS AND COLLECTIVE ACTION 


Breakdown and resource mobilization theorists also differ on moral features'of 
collective action. Useful in describing these differences is Peter Gay's (1993) 
point that aggression implies “attack” which, in ordinary language, has two 
meanings. Attack may refer to a setting upon another 1n a hostile way. Exam- 
ples include a criminal attack, an attack from ambush in warfare, or a scathing 
attack by a literary critic. But attack can also refer to an adaptive mastery: A 
scientist may attack a puzzling problem of nature; an orchestra can attack a 
challenging composition; or a university may aggressively pursue a program 
of equal employment opportunity. In Gay's account, the aggressiveness of the 


|The data sources for Figure 1 are described in Appendix A. Riot activity was measured! by 
transforming into z-scores the per annum number of arrests, injuries, deaths, days rioted and arsons, 
and then summing those scores for each year. A year could have a negative intensity score if it fell 


below the mean on one or more of these measures | 
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Figure 1 Homucides, new handguns, and not intensity, United States, 1964—1994. 


bourgeoisie of the nineteenth century exacted grievous costs, in the form of 
abused natives, exploited laborers, and discarded artisans, but 1t also permitted 
people to transform and shape their environment in ways unprecedented 
(1993, p. 6). Attack can, as well, be counteraggression: One can fight back 
against bullies, parry insults, and assert oneself against cutthroat competition. 
Few would challenge that collective action is aggression. The disagreement 
is over the kind of aggression, or attack, it is. Resource mobilization theorists 
are mainly creatures of the 1960s and 1970s and, as such, tend to emphasize ag- 
gression's positive dimensions. Collective action is ordinary people taking 
control over their lives, in this view. For example, Charles Tilly (1978, p. 2) 
opens From Mobilization to Revolution by describing an attempt by English 
villagers in 1765 to pull down a workhouse, and he is at pains to point out that 
in the sense of “frenzy, confusion, or wanton destruction" the event was “not a 
riot”; on the contrary, both sides “knew what they were doing" and “did as best 
they could." These events, Tilly makes clear, were constructive aggression. 
Breakdown theorists are more likely to see collective action as hostile ag- 
gression. They concede that collective action may secure immediate conces- 
sions and may even plant the seeds of a new social order. But they are far more 
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inclined than RM theorists to worry that collective action has another si 
This concern permits James Q. Wilson (1993, p. 229—230) to refer to the 
events in Los Angeles in April 1992 not as a rebellion, but as the “terrible ri- 
ots" that “racked the city I love.” 

Indeed, only the most rigid and unreconstructed resource poniai 
theorists would fail to see hostile aggressiveness when, during both the 1980 
Miami riot and 1992 Los Angeles riot, passing motorists were pulled from 
their vehicles to be beaten, maimed, or even killed. Portes & Stepick (1993, P. 
48—49) describe Miami’s “nightmarish” experience: “Whites were doused 
with gasoline and set afire in their cars; others were dragged out and beaten re- 
peatedly with chunks of concrete and bricks, run over by cars, stabbed with 
screwdrivers, and shot. One was left dying in the street, a red rose in his 
mouth." 

But how is one to know if, in any particular instance of aggression, con- 
structive or destructive impulses predominate? One problem is that the answer 
will depend upon whether one relies on statements by the aggressor or the 
party whose ox is gored. During the April 1992 riots, Korean businesses were 
targeted for looting and arson (Ong & Hee 1993, Tierney 1994). From the Ko- 
rean merchants’ point of view, the riot was driven by destructive impulses, at 
best, by a mean-spirited revenge for perceived slights of honor and disrespect. 
But what target of aggression would agree that the blows against them are war- 
ranted? 

Moreover, to continue with the example, what struck James Q. Wilson 
(1993, p. 230) most about the April 1992 riots was that the rioters felt an obli- 
gation to justify their behavior. From this Wilson makes an interesting obser- 
vation: Individuals inevitably offer a moral justification whenever their ac- 
tions, however destructive and purely self-interested, violate a moral principle. 
Thus, the fact that the Los Angeles rioters offered a moral justification for their 
action is not informative. 

Yet how does Wilson know that the rioters’ complaints of injustice and dis- 
crimination were disingenuous, that their explanations of their own behavior 
were mere rationalizations, and that the firebombs and looting were not war- 
ranted by abuses of the past? There were plenty of observers willing to accept 
these justifications, who insisted that the events were a rebellion and not a riot 
(e.g. Johnson et al 1992). 

The problem is that human complexity is not suspended when people pd 
onto the streets. Collective behavior is behavior. Certainly Kakar’s work on 
Hindu-Muslim violence suggests that surface discourse may imperfectly re- 
flect underlying concerns. What actuates aggressors may be obscured by the 
defensive stratagems that Peter Gay calls “alibis for aggression”: beliefs, prin- 
ciples, and comforting bromides used to rationalize verbal or physical mili- 
tancy against others (1993, p. 1-8, 35-38). Perhaps alibis for aggression dre 
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what Wilson heard expressed by the Los Angeles rioters, although I know of 
no evidence that would prove this. The best we can do is carefully tease out the 
evidence, case by case. We are far from the last chapter on urban riots, far from 
even knowing what they are about. 


FUTURE DIRECTIONS 


To recapitulate, breakdown theorists argue that defiant or “nonroutine” forms 
of collective action occur when the mechanisms of social control falter or eve- 
ryday routines are disrupted. If this stance is accepted as plausible, future work 
should identify different ways in which breakdown processes may generate 
collective disorders. Possible formulations include these: 


Social Capital/Breakdown Theory 


Social capital theorists argue that economic prosperity requires strong net- 
works of cooperation among people and high levels of interpersonal trust asso- 
ciated with such networks (Coleman 1990, Putnam 1993, Fukuyama 1995). 
For students of collective behavior, the social capital perspective puts two dis- 
tinct causal arguments on the agenda. 

One concerns the connection, if any, between social capital and rebellion. Fu- 
kuyama (1995, p. 295—306; 1996) asserts that US urban riots, both in the 1960s 
and 1990s, were an outcome of low levels of social capital in the African- 
American community: a predominance of single-parent families and weak 
larger groups. This lack of cohesion is a legacy of slavery, Fukuyama argues. 

A second key question concerns the connection among state intervention, 
social capital, and rebellion. Here the analysis is complicated by the fact that 
social capital theorists are divided between those who argue that state inter- 
vention crowds out social capital and those who imply a synergistic relation- 
ship (Evans 1996). The former position is that state intervention, including 
law, tends to destroy social trust and norms of cooperation (Coleman 1990, 
Pildes 1996, Cover 1983). The latter position is that effective state intervention 
promotes civic engagement and reshapes norms in a positive direction and, 
conversely, that high levels of civic engagement nourish state effectiveness 
(Putnam 1993, Sunstein 1996, Posner 1996). 

While no resolution to this debate is in sight, adding rebellion to the equa- 
tion is intriguing. The crowd-out theorist would argue that state intervention 
might have short-run benefits, such as the amelioration of specific grievances, 
but long-run costs: a decline of productive cooperation, pervasive cynicism, 
and disaffection that presage urban crime and disturbances. The synergists 
would argue that state intervention, when practiced skillfully and strategically, 
would increase a population’s attachment to society. Collective action would 
take primarily “routine” forms rather than “nonroutine” forms. 
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Routine-Disorder/Breakdown Theory 


James Q. Wilson (1968) observed that, for every person, there is a public space 
in which his or her sense of safety, propriety, and self-worth are either affirmed 
or jeopardized by the events and people encountered. Hence, Wilson goes on 
to argue, routine disorders—vandalism, public drinking, graffiti, and corner 
gangs—corrode community morale, stimulate people and commerce to move 
away, and cause crime. Wesley Skogan (1990) verified key elements of Wil- 
son's argument, based on data collected on crime, disorder, and residents' per- 
ceptions of disorder in 40 urban neighborhoods. He shows, first, that urban 
residents, regardless of racial, ethnic, or economic background, agree about 
what constitutes order and how much there is in their neighborhood; second, 
disorder both spawns serious crime and plays a key role in neighborhood de- 
cline. Subsequent researchers have also verified the routine-disorders argu- 
ment as well as amplified theoretical aspects of it (Bursik & Grasmick 1993, 
Kelling & Coles 1996, Sampson 1995). | 
Wilson's insight and its verification suggest the possibility that routine dis- 
orders may also contribute to nonroutine collective action. For example, one 
would like to see if Skogan's objective and subjective indices of disorder could 
also be used to predict the occurrence of US urban riots. If this were shown to 
be true, the findings would fit nicely with the breakdown position. : 


Clash of Civilizations/Breakdown Theory 


Samuel Huntington (1996a, 1996b) has argued that, with the end of the Cold 
War, the world is becoming increasingly divided along the lines of culture and 
religion rather than economics or ideology. Accordingly, the clash between the 
world's seven or eight major civilizations will be the basis of the most hostile 
conflicts, which pose the greatest risk of escalating into collective violence. 
Huntington goes on to predict that non-Western countries formerly united by 
ideology or historical circumstance but divided by civilization will tend to fall 
apart with a high potential for collective violence. In the Western context, 
countries that fall prey to multiculturalism and uncontrolled immigration will 
experience a disuniting of society, followed by decay and disintegration. 
Huntington's argument fits squarely within the breakdown tradition: Nonrou- 
tine collective action erupts when a mechanism of integration—in this formu- 
lation, a cultural core—is weakened. 
In sum, social-capital, routine-disorder, and clash-of-civilizations perspec- 
tives may each serve as a launching pad to rethink the sources and dynamics of 
collective action. To the extent that these routes are taken, they will require us 
to relax the assumption that the nation state is the unit around which collective 
action revolves in all cases, to distinguish more clearly the causes of routine 
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and nonroutine collective action, and to look to cultural differences as sources 
of conflict. 


CONCLUSION 


Rebellion would be easier to understand, our theories of it more parsimonious 
and powerful, if it flowed out of one condition and toward one purpose: power 
politics. To be sure, much collective action does involve political contention 
and constitutes politics by other means. But to breakdown theorists, this does 
not exhaust the range of possibilities. 

Ethnic and racial groups may attack one another in a spiral of hatred and re- 
venge. From time to time, collective action appears to serve as the vehicle 
through which the dispossessed and those detached from and unconcerned 
about the welfare of society express their hostility, vent their rage, or secure 
short-term material advantages. While these goals are not irrational (nothing is 
more individually rational than crime), neither are they politics by other means. 

The breakdown position is not an “intellectual weapon" concocted to “dis- 
credit mass movements of which one is critical," as Gamson (1990, p. 133) al- 
leges; nor is it cynicism. Rather, it is a recognition of the complexity and diver- 
sity of social life. To argue that all collective action is part and parcel of politi- 
cal struggle is to exaggerate the centrality of power and impute an ideology of 
social change where none may exist. 

In my view, efforts to replace breakdown theory with resource mobilization 
theory are ill-founded. Both logic and evidence seem to suggest that the break- 
down and RM theories explain different kinds of collective action. Each ap- 
proach deserves recognition. Also, much of interest appears to occur in a mid- 
dle ground, in some sort of amalgam between breakdown and RM processes. 

In sum, breakdown theory was developed to explain collective action that 
involves a basic rupture of the social order. Breakdown processes do not de- 
stroy a community root and branch, producing a mass of atomized individuals, 
but they can cripple a community's ability to perform key functions. Such rup- 
tures are rare. But they do happen. 
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Appendix A 


The data on riot intensity are from three sources: a data set provided by Gregg 
Lee Carter, covering the period 1964 to 1971; three major indexed papers (New 
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York Times, Washington Post, Chicago Tribune), for the period 1972 to 1985; 
and the Westlaw Inc. newspaper database, for the period 1986 to 1994. Carter 
collected his data from five key sources, including published and unpublished 
data from the Lemberg Center for the Study of Violence at Brandeis University, 
the New York Times and Washington Post, and compilations made by the staff of 
a US Senate commitee (for a detailed discussion, see Carter, 1983, ch. 2). The 
Westlaw database permits a key-word computer search of 150 US newspapers. 
In general, it would be preferable to use a single data source for the entire 
period. The problem is that the only data source that covers the entire period is 
the major indexed newspapers. A decision not to use the Carter and Westlaw 
data would have resulted in a significant loss of information for the periods 
they cover. Balancing the tradeoffs, I decided to make use of the data from 
Carter and Westlaw. Carter (1983, p. 75—78) argues, with supporting evidence, 
that a riot severity index is best constructed using five indices (arrests, injuries, 
deaths, arsons, and days rioted). I accepted this argument. The five indices for 
each riot were transformed to Z-scores, which were then summed for each 
year. The data on homicides are from the Bureau of Justice Statistics (1996). 
The handgun data are from annual reports of the Bureau of Alcohol, Tobacco 
and Firearms (1991, 1995). 
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ABSTRACT 


Since the late 1970s, theorizing in psychological social psychology has been 
dominated by the computer metaphor of information processing models, 
which fostered an emphasis on “cold” cognition and the conceptualization of 
individuals as isolated information processors. More recent research shows a 
renewed interest in the interplay of feeling and thinking in social judgment 
and in the role of unconscious processes in reasoning and behavior. More- 
over, research into socially situated cognition and the interplay of communi- 
cation and cognition highlights the role of conversational norms, social inter- 
dependence, and power in social judgment. Experimental research into these 
issues is reviewed. The emerging picture is compatible with social psychol- 
ogy’s latest metaphor, humans as motivated tacticians who pragmatically 
adapt their reasoning strategies to the requirements at hand. 


INTRODUCTION 


Independent of their theoretical orientations, social psychologists agree on two 
basic tenets (Taylor 1997, p. 3): 


The first 1s that 1ndividual behavior is strongly influenced by the environ- 
ment, especially the social environment. The person does not function in an 
individualistic vacuum, but in a social context that influences thought, feel- 
ing, and action. The second point of consensus 1s that the individual actively 
construes social situations. We do not respond to environments as they are, 
but as we interpret them to be. 
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For the past two decades, psychological social psychologists have largely 
focused on the latter tenet, emphasizing inside-the-head phenomena, and have 
left the former tenet to sociological social psychologists. As Markus & Za- 
jonc (1985) observed, [psychological] “social psychology and cognitive so- 
cial psychology are today nearly synonymous. The cognitive approach is 
now clearly the dominant approach among social psychologists, having virtu- 
ally no competitors" (1985, p. 137). Moreover, the nature of cognitive theoriz- 
ing shifted from an emphasis on “warm” cognition and motivated reasoning to 
an emphasis on "cold" cognition. Consequently, memory processes, logical 
inference, and cognitive biases became key topics of interest in cognitive so- 
cial psychology. This shift reflected the advent of information processing 
models in cognitive psychology in the late 1960s (Neisser 1967; for an intro- 
duction see Lachman et al 1979) and their rapid adoption by social psycholo- 
gists in the 1970s, documented in a seminal volume on "person memory" 
(Hastie et a] 1980). The resulting theoretical approach, known as social cogni- 
tion, has been characterized as social psychology under the paradigm of infor- 
mation processing (Strack 1988). It emphasized information encoding, stor- 
age, and retrieval and drew heavily on computer metaphors (see Wyer & Srull 
1989). 

On the positive side, the information processing approach has rapidly ad- 
vanced our understanding of the cognitive processes underlying many social 
phenomena (for reviews, see Devine et al 1994). Moreover, its emphasis on 
detailed process models has changed the field's standards for what counts as 
appropriate evidence: "It was no longer enough to detail a theoretical model 
and use the results as confirmation of the model. If one stated what one 
thought the process was, one had to demonstrate the intervening steps" (Taylor 
1998, p. 74). To obtain relevant “process data,” social psychologists now in- 
clude response latencies, recall and recognition tests, think-aloud protocols, 
and similar measures in their experiments (for reviews, see the contributions 
in Schwarz & Sudman 1996), and they increasingly employ mediational 
analyses (Baron & Kenny 1986) to test the theoretically specified process as- 
sumptions. 

Yet the adoption of the information processing paradigm was not without 
its drawbacks. Most importantly, the computer metaphor, around which mod- 
els of information processing are built, constrained the range of phenomena 
addressed by cognitive social psychologists in several ways. First, this meta- 
phor does not easily lend itself to investigations of emotional and motivational 
influences on human cognition and behavior and hence fostered a neglect of 
“warm” cognition in favor of an emphasis on “cold” cognition. Consequently, 
the complex treatment of perceivers’ goals and needs that characterized the 
New Look (e.g. Bruner 1957) was reduced to simpler processing goals, such as 
forming an accurate impression versus remembering the material presented in 
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the experiment (e.g. Hamilton et al 1980; for a critical discussion, see Hilton & 
Darley 1991). Similarly, the role of affective states in cognition and behavior 
was largely ignored (Zajonc 1980). Second, reliance on the computer meta- 
phor as a guiding framework also fostered an exclusive concentration on indi- 
viduals as isolated information processors. This focus resulted in a neglect of 
the social context in which humans do much of their thinking, prompting 
Schneider (1991, p. 553) to wonder, “Where, oh where, is the social in social 
cognition?" 

Finally, investigations of overt social behavior became a rare event. Ás For- 
gas (1981, p. 3) observed early on, “social psychology found itself transformed 
into a field now mainly concerned not with human social action, but with hu- 
man beings as thinkers and information processors about social stimuli." And 
whatever these "thinkers and information processors" thought about, they 
seemed likely to get it wrong, as a long list of biases and shortcomings in social 
judgment illustrated (Nisbett & Ross 1980, Ross 1977). 

More recently, however, the field has moved beyond these constraints. 
Cognitive social psychologists have turned to explorations of the role of 
moods, emotions, goals, and motivations in human reasoning, with “warm” 
cognition receiving considerable attention, as illustrated in the three volumes 
of the Handbook of Motivation and Cognition (Sorrentino & Higgins 1986, 
1995; Higgins & Sorrentino 1990). Moreover, they have rediscovered that hu- 
mans do much of their thinking in a social context and have turned to the ex- 
ploration of socially situated cognition (for a review, see Levine et al 1993) 
and the interplay of cognition and communication in human reasoning (for re- 
views, see Hilton 1990, 1991, 1995; Krauss & Fussell 1996; Schwarz 1996). 
Complementing these developments, they reconsidered the fallibility of hu- 
man judgment from a pragmatic perspective, emphasizing William James's 
(1890, p. 333) credo that “my thinking is first and last and always for the sake 
of my doing." From this perspective, many violations of normative models 
seem less detrimental than earlier discussions implied, leading some observers 
to conclude that “people are good enough perceivers" (Fiske 1992). Although 
human social action is still not a common topic in cognitive social psychology, 
its cognitive and motivational underpinnings have received renewed attention 
(see the contributions in Gollwitzer & Bargh 1996). 

Ireview these developments by highlighting representative lines of re- 
search. The first section of the review illustrates the renewed interest in 
“warm” cognition by addressing one theme of this development, namely the 
interplay of feeling and thinking in social judgment, stereotyping, and persua- 
sion (for a broader coverage of ^warm" cognition, see the contributions in Sor- 
rentino & Higgins 1986, Higgins & Sorrentino 1990). The second section ad- 
dresses socially situated cognition and emphasizes the influence of the imme- 
diate social context on individuals' judgment strategies. Research in this do- 
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main has highlighted the role of social interdependence, power, and account- 
ability and has drawn attention to the influence of conversational processes on 
human reasoning. 

The picture that emerges from this work is consistent with cognitive social 
psychology's latest metaphor, the person as a motivated tactician (Fiske & 
Taylor 1991). "The motivated tactician is viewed as having multiple informa- 
tion processing strategies available, selecting among them on the basis of 
goals, motives, needs, and forces in the environment. As such, the motivated 
tactician exemplifies the pragmatic tradition in social psychology" (Taylor 
1998, p. 75). As becomes apparent in this review, the choice of processing 
strategies is highly sensitive to a wide range of relevant information, provided 
by the immediate social context, internal states, and long-term goals. The mo- 
tivated tactician metaphor is compatible with a growing number of dual- 
process models that distinguish between heuristic and systematic processing 
strategies in person perception (e.g. Brewer 1988, Fiske & Neuberg 1990), 
persuasion (e.g. Bohner et al 1995, Eagly & Chaiken 1993, Petty & Cacioppo 
1986), and decision making (e.g. Payne et al 1993) and identify the conditions 
under which different strategies are likely to be employed (for reviews, see the 
contributions in Chaiken & Trope 1998). At present, this metaphor promises to 
replace its predecessors—which captured earlier research themes by portray- 
ing persons either as consistency seekers, wishful thinkers, lay scientists, or 
cognitive misers—by suggesting that all of them apply under some specific 
conditions. Yet, the motivated tactician's ability to flexibly adjust his or her 
cognitive processes to situational requirements is not without limits, as an in- 
creasing body of research into unconscious processes and the limits of mental 
control indicates. Selected aspects of this research are addressed 1n the final 
section of this review. 


THE INTERPLAY OF FEELING AND THINKING 


Cognitive social psychologists’ interest in the interplay of feeling and thinking 
has mostly focused on global happy and sad moods rather than on specific 
emotions, like anger or fear. Whereas emotions have a specific referent (1.e. we 
are angry "about something") and draw attention to the eliciting event, moods 
lack a specific referent (i.e. we are “in” a happy mood) and are of a more dif- 
fuse and less intense nature, usually not capturing individuals' attention (see 
Clore et al 1994, Morris 1989). As an important consequence, moods may 
function in the background of other activities, influencing a wide range of cog- 
nitive processes and overt behaviors. Of particular interest has been their influ- 
ence on memory, judgment, and the choice of information processing strate- 


gies. 
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Memory and Judgment 


Numerous studies demonstrated that people evaluate nearly everything— 
ranging from consumer goods and the state of the economy to the quality of 
their lives—more positively when in a good than in a bad mood (for reviews, 
see Clore et al 1994, Forgas 1995, Schwarz & Clore 1996). Moreover, these in- 
fluences translate into overt behavior (for a review, see Isen 1984), including 
weather-induced mood effects on the stock market (Saunders 1993). In most 
studies, moods have been experimentally induced, either by minor events (e.g. 
finding a dime or receiving a cookie), exposure to valenced material (e.g. 
watching a sad video or recalling a happy event from one's past), or natural cir- 
cumstances (e.g. sunny or rainy weather), with similar results across different 
manipulations. The impact of moods on judgment has been traced to two dif- 
ferent processes, namely differential recall of material from memory and the 
use of one's feelings as a source of information. 


MEMORY Individuals are more likely to recall positive material from mem- 
ory when they are in a happy rather than sad mood. Following initial sugges- 
tions by Isen et al (1978), Bower (1981, 1991) conceptualized these effects in 
an associative network model of memory. Moods are thought to function as 
central nodes in an associative network, which are linked to related ideas, 
events of corresponding valence, autonomic activity, and muscular and ex- 
pressive patterns. When new material 1s learned, it is associated with the nodes 
that are active at learning. Accordingly, material acquired while in a particular 
mood is linked to the respective mood node. When the person is in the same 
mood later on, activation spreads from the mood node along the pathways, in- 
creasing the activation of other nodes, which represent the related material. 
When the activation exceeds a certain threshold, the represented material 
comes into consciousness. This model makes two key predictions: First, mem- 
ory is enhanced when the affective state at the time of encoding matches the af- 
fective state at the time of retrieval (state-dependent learning). Thus, we are 
more likely to recall material acquired in a particular mood when we are in the 
same, rather than a different, mood at the time of recall. Second, any given ma- 
terial is more likely to be retrieved when its affective tone matches the indi- 
vidual's mood at the time of recall (mood-congruent memory). Thus, informa- 
tion of a positive valence is more likely to come to mind when we are in a 
happy rather than sad mood. 

Although both predictions received considerable empirical support in ex- 
perimental and clinical research, this research also revealed a number of com- 
plications that are beyond the scope of this chapter (see Blaney 1986, Clore et 
al 1994, Morris 1989, Singer & Salovey 1988). In general, mood-congruent re- 
call 1s most likely to be obtained for self-referenced material, such as autobio- 
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graphical events, that meets the conditions of both of the above hypotheses: 
When something good (bad) happens to us, it puts us in a positive (negative) 
affective state, and its subsequent recall 1s facilitated when we are again in a 
similar affective state. Note that this situation simultaneously provides for 
matching mood states at learning and recall, thus satisfying the conditions of 
state-dependent learning, as well as for matches between the valence of the 
material and the mood at recall, thus satisfying the conditions of mood- 
congruent memory. 

From this perspective, mood effects on judgment are mediated by mood- 
congruent recall from memory: When asked how satisfied we are with our life, 
for example, we recall relevant information from memory. However, positive 
(negative) material is more likely to come to mind when we are in a happy 
(sad) mood, resulting in a mood-congruent sample of relevant information and 
hence a mood-congruent judgment. 


FEELINGS AS INFORMATION As an alternative approach, Schwarz & Clore 
(1983, 1996) suggested that our feelings themselves may serve as a source of 
information in making a judgment. Specifically, individuals may simplify the 
judgmental task by asking themselves, “How do I feel about it?” Some evalua- 
tive judgments refer, by definition, to one’s affective reaction to the target (e.g. 
feelings of liking), and one’s current feelings may indeed be elicited by the tar- 
get. However, due to the undifferentiated and unfocused nature of moods, it is 
often difficult to distinguish between one's affective reaction to the object of 
judgment and one's preexisting mood state. Accordingly, individuals may 
misread their preexisting feelings as a reaction to the target, resulting in more 
positive evaluations under happy rather than sad moods. 

If so, mood effects should be eliminated when the informational value of 
one's current feelings for the judgment at hand is called into question. Empiri- 
cally, this is the case. For example, respondents reported lower life-satis- 
faction in telephone interviews when called on rainy rather than sunny days, 
reflecting the impact of the weather on their current mood. This effect was 
eliminated, however, when the interviewer inquired about the weather, thus 
drawing respondents' attention to this extraneous source of their mood 
(Schwarz & Clore 1983). Such a discounting effect would not be expected if 
the impact of mood were mediated by mood-congruent recall: Attributing 
one's sad mood to the rainy weather only discredits the informational value of 
the sad mood as a reflection of one's life in general; it does not discredit the im- 
plications of actually experienced negative life events that may come to mind. 

These and related findings indicate that moods, emotions (e.g. Keltner et al 
1993), bodily states (e.g. Stepper & Strack 1993, Zillman 1978), and phe- 
nomenal experiences (e.g. Clore 1992) may themselves serve as a source of in- 
formation that individuals draw on according to a "*How-do-]-feel-about-it?" 
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heuristic (for reviews, see Schwarz & Clore 1996). Accordingly, feelings may 
influence judgments either directly, by serving as a source of information, or 
indirectly, by influencing what comes to mind. Consequently, the motivated 
tactician's judgments are likely to be tainted by current moods independent of 
whether the person engages in heuristic judgment strategies or engages in 
more elaborate recall of relevant information from memory (Forgas 1995). 
Complementing this focus on cognitive processes, other researchers have at- 
tempted to illuminate the physiological and brain mechanisms underlying af- 
fective influences on human reasoning, a topic that 1s beyond the scope of this 
chapter (for reviews, see Damasio 1994, LeDoux 1996, Cacioppo et al 1996). 


Moods and Processing Strategies | 
n found 


In addition to influencing memory and judgment, moods have also b 

to influence performance on a wide variety of cognitive tasks (for reviews and 
different theoretical perspectives, see Clore et al 1994, Fiedler 1988, Forgas 
1995, Isen 1987, Schwarz & Clore 1996). While the findings bearing on for- 
mal reasoning tasks (such as syllogistic reasoning, puzzles, or anagrams) are 
complex and inconsistent (see Clore et al 1994), the findings bearing on social 
reasoning tasks (most notably, impression formation, stereotyping, and per- 
suasion) show a more coherent pattern. In general, individuals in a sad mood 
are more likely to use a systematic, data-driven strategy of information proc- 
essing, with considerable attention to detail. In contrast, individuals in a happy 
mood are more likely to rely on preexisting general knowledge structures, us- 
ing a top-down, heuristic strategy of information processing, with less atten- 
tion to detail. 

Consistent with the motivated tactician metaphor, these differences can 
again be traced to the informative functions of feelings (Bless et al 1996, 
Schwarz 1990, Schwarz & Clore 1996). We usually feel bad when we encoun- 
ter a threat of negative or a lack of positive outcomes, and feel good when we 
obtain positive outcomes and are not threatened by negative ones. Hence, our 
moods reflect the state of our environment, and being in a bad mood signals a 
problematic situation, whereas being in a good mood signals a benign situa- 
tion. If so, we may expect that the motivated tactician's thought processes are 
tuned to meet the situational requirements signaled by these feelings. When 
negative feelings signal a problematic situation, the individual is likely to at- 
tend to the details at hand, investing the effort necessary for a careful analysis. 
In contrast, when positive feelings signal a benign situation, the individual 
may see little need to engage in cognitive effort, unless this is required by other 
current goals. Hence, the individual may rely on preexisting knowledge struc- 
tures, which have worked well ın the past, and may prefer simple heuristics 
over more effortful, detail-oriented judgmental strategies. 
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Numerous studies are compatible with this general perspective, as illus- 
trated below. Importantly, mood effects on processing style are eliminated 
when the informational value of the mood is undermined (Sinclair et al 1994), 
paralleling the findings in the judgment domain discussed above. This finding 
supports the informative functions logic and is incompatible with competing 
approaches that attempted to trace mood effects on processing style to differ- 
ential influences of happy and sad moods on individuals' cognitive capacity 
(e.g. Mackie & Worth 1989; for a detailed discussion, see Schwarz & Clore 
1996). 


IMPRESSION FORMATION AND STEREOTYPING In forming an impression of 
others, we can rely on detailed information about the target person or can sim- 
plify the task by drawing on preexisting knowledge structures, such as stereo- 
types pertaining to the target's social category (Brewer 1988, Bodenhausen 
1990, Fiske & Neuberg 1990, Macrae et al 1994b). Consistent with the above 
perspective, being in a good mood has consistently been found to increase 
stereotyping (e.g. Bodenhausen et al 1994a,b), unless the target person is 
clearly inconsistent with the stereotype, thus undermining the applicability of 
the general knowledge structure (e.g. Bless et al 1996c). In contrast, being in a 
sad mood reliably decreases stereotyping and increases the use of individuat- 
ing information (for a review, see Bless et al 1996b). Across many person per- 
ception tasks, individuals in a chronic or temporary sad mood have been found 
to make more use of detailed individuating information, to show less halo ef- 
fect, to be less influenced by the order of information presentation, and to be 
more accurate in performance appraisals than individuals in a happy mood, 
with individuals in a neutral mood falling in between (e.g. Edwards & Weary 
1993, Hildebrandt-Saints & Weary 1989, Sinclair 1988, Sinclair & Marks 
1992). Similar findings have been obtained for individuals' reliance on scripts 
pertaining to typical situations (such as having dinner 1n a restaurant) versus 
their reliance on what actually transpired in the situation (Bless et al 19962). 
Throughout, individuals in a good mood are more likely to rely on preexisting 
general knowledge structures, proceeding on a “business-as-usual” routine, 
whereas individuals in a sad mood are more likely to pay close attention to the 
specifics at hand, much as one would expect when negative feelings provide 
the motivated tactician with a problem signal. 


PERSUASION Research into mood and persuasion parallels these findings. In 
general, a message that presents strong arguments is more persuasive than a 
message that presents weak arguments, provided that recipients are motivated 
to process the content of the message and to elaborate on the arguments. If re- 
cipients do not engage in message elaboration, the advantage of strong over 
weak arguments 1s eliminated (for reviews, see Eagly & Chaiken 1993, Petty 
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& Cacioppo 1986). Numerous studies demonstrated that sad individuals are 
more likely to engage in spontaneous message elaboration than happy indi- 
viduals, with individuals in a neutral mood falling in between (for reviews, see 
Mackie et al 1992, Schwarz et al 1991a). Consequently, sad individuals are 
strongly influenced by compelling arguments and not influenced by weak ar- 
guments, whereas happy individuals are moderately, but equally, influenced 
by both. Hence, a strong message fares better with a sad than with a happy 
audience, but if communicators have nothing compelling to say they had better 
put recipients into a good mood. 

Importantly, happy individuals' spontaneous tendency not to think about 
the arguments in much detail can be overridden by other goals (e.g. Wegener et 
al 1995) or explicit task instructions (e.g. Bless et al 1990, Martin et al 1993). 
What characterizes the information processing of happy individuals is not a 
general cognitive or motivational impairment, but a tendency to spontaneously 
rely on simplifying heuristics and general knowledge structures in the absence 
of goals that require otherwise, again consistent with the metaphor of the moti- 
vated tactician. 


Summary 


In summary, social cognition researchers have complemented the emphasis on 
“cold” cognition that characterized initial research under the information proc- 
essing paradigm by exploring the interplay of feeling and thinking. This re- 
search revealed pervasive effects of temporary feelings on memory, judgment, 
and reasoning strategies across a wide range of content domains. The emerg- 
ing findings highlight that our feelings serve informative functions in judg- 
ment and in the self-regulation of cognitive activity, thus linking affect, cogni- 
tion, and motivation. To date, this work has disproportionately focused on 
global moods at the expense of more specific emotions (but see Mathews & 
MacLeod 1994). Moreover, it has not addressed how the social context in 
which a given feeling is experienced may moderate its impact on cognitive and 
motivational processes. 


SOCIALLY SITUATED COGNITION 


The increased attention to “warm” cognition remained true to cognitive (so- 
cial) psychology’s focus on “inside-the-head” phenomena, while filling some 
of the more glaring gaps fostered by the computer metaphor of information 
processing models. Complementing this richer picture of inside-the-head phe- 
nomena, cognitive social psychologists have recently rediscovered that hu- 
mans do much of their thinking in a social context. Consistent with the moti- 
vated tactician metaphor, researchers explored how characteristics of the so- 
cial situation influence cognition and information processing strategies, pay- 
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ing attention to variables such as the accountability of the actors (e.g. Tetlock 
1992), their anticipation of audience reactions (e.g. McCann & Higgins 1992), 
and their interdependence and position in a power hierarchy (e.g. Fiske & 
Depret 1996). A related line of work emphasized the impact of conversational 
processes on human reasoning and traced many cognitive biases and apparent 
shortcomings of judgment documented in the laboratory to the specific nature 
of communication in research situations, suggesting that the observed biases 
may be less powerful under more natural conditions (e.g. Hilton 1995, 
schwarz 1996). In addition, cognitive social psychologists turned to collabora- 
tive cognition and extended previous research into group problem solving by 
exploring how people collaborate on reasoning and memory tasks (e.g. 
Resnick et al 1991, Wegner 1987). Finally, the field has developed an in- 
creased interest in cultural influences on cognitive processes and human be- 
havior (e.g. Markus & Kitayama 1991, Nisbett & Cohen 1996, Goldberger & 
Veroff 1995; for a comprehensive review, see Fiske et al 1998. 

The next two sections review selected aspects of cognitive social psycholo- 
gists’ interest in situated cognition and highlight that thought processes are 
tuned to meet situational requirements, consistent with the motivated tactician 
metaphor. 


The Motivated Tactician in Social Situations 


As noted previously, dual-process models of judgment (Chaiken & Trope 
1998) emphasize that individuals can rely on systematic processing strategies 
that involve considerable effort and attention to detail or can rely on heuristic 
processing strategies that simplify the task at hand. As implied by the moti- 
vated tactician metaphor, the choice of processing strategies is highly sensitive 
to individuals’ goals (for reviews, see Gollwitzer 1990, Hilton & Darley 1991) 
and features of the social situation, as some exemplary lines of research may il- 
lustrate. 


INTERDEPENDENCE AND POWER In the domain of person perception, individu- 
als are most likely to simplify impression formation by drawing on the target 
person's category membership at the expense of more detailed individuating 
information when little is at stake. Hence, stereotyping is the rule under stan- 
dard laboratory conditions, but the impact of stereotypes is attenuated when 
participants expect future interactions with the target person under conditions 
of interdependence (e.g. Berscheid et al 1976, Neuberg & Fiske 1987, Neuberg 
1989), independent of whether the interdependence is cooperative or competi- 
tive in nature. Fiske & Neuberg (1990; see also Brewer 1988) proposed a con- 
tinuum model of impression formation that assigns a crucial role to perceivers’ 
goals: When the relationship is important to the perceiver, the perceiver will 
allocate sufficient cognitive resources to form an individuated impression 
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based on specific information about the target person; when the relationship is 
unimportant, the perceiver will simplify the judgmental task by drawing on the 
target's category membership, resulting in a stereotypic judgment. 

Extending this work, Fiske and colleagues (for a review, see Fiske & Depret 
1996) have recently addressed the role of power in person perception. They ob- 
served that powerless individuals, whose outcomes depend on a more powerful 
other's actions, are particularly likely to seek the most diagnostic information 
available about the powerful other. In contrast, powerful individuals not only 
fail to seek complex information about their subordinates but have been found 
to attend more to stereotype-confirming than to stereotype-disconfirming in- 
formation, thus further increasing the impact of stereotypes. "Whereas power- 
lessness demands vigilance, power allows people to ignore the most informa- 
tive cues about others," as Fiske & Depret (1996, p. 34) concluded—at least as 
long as they are not dependent on the powerless other's actions. 


ACCOUNTABILITY AND AUDIENCE EFFECTS Muchas research into impression 
formation has typically ignored the interdependence of judge and target, re- 
search into social judgment and decision making has typically studied indi- 
viduals' thought processes in a social vacuum. Consequently, participants in 
judgment studies “do not need to worry about the interpersonal consequences 
of their conduct (“How will others react if I do this? How effectively can I jus- 
tify my views if challenged?’),” as Tetlock (1992, p. 335) observed. Extending 
earlier work into audience effects on cognitive processes (e.g. Zajonc 1960), 
Tetlock and his colleagues explored the impact of judges’ accountability on 
their choice of information processing strategies and conceptualized the find- 
ings in a social contingency model of judgment, which makes three key predic- 
tions (for a comprehensive review, see Tetlock 1992). 

First, when individuals have no prior commitment to a position, are ac- 
countable for their conduct, and know their constituency’s attitudes and pref- 
erences, “they simply adopt positions likely to gain the favor of those to whom 
they feel accountable” (Tetlock 1992, p. 340), essentially following an accept- 
ability heuristic. For example, individuals who have to justify their views to a 
liberal audience report more liberal opinions than those who have to justify 
them to a conservative audience. When these reports pertain to a well-known 
attitude object, the observed conformity effects at the reporting stage are usu- 
ally short lived and do not result in enduring attitude change on private meas- 
ures (e.g. Tetlock et al 1989). This is not the case, however, when the reports 
pertain to a previously unknown attitude object, such as a target person de- 
scribed by the experimenter (e.g. Higgins & Rholes 1978). In this case, tailor- 
ing one’s judgments to an audience that is assumed to like (or dislike) the per- 
son has been found to result 1n private attitude change as well as biased recall 
(for reviews, see Higgins 1981, McCann & Higgins 1992). In Higgins & 
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Rholes's (1978) study, for example, participants were asked to reproduce the 
original information about the target person, and these recall data indicated 
positive or negative distortions, respectively, after participants described the 
person to a positively or negatively predisposed audience. Such modifications 
of the mental representation of the attitude object are most likely when the re- 
spective representation is not well formed at the time the individual tailors the 
report to an audience. When such modifications of the mental representation 
occur, they are likely to result in lasting effects on subsequent judgments. 

Second, when individuals have no prior commitment to a position and do 
not know their constituency's attitudes and preferences, accountability moti- 
vates them “to abandon their cognitive miserly ways and to become relatively 
flexible, self-critical and multidimensional thinkers" (Tetlock 1992, p. 340). 
Under these conditions, content analyses of confidential thought protocols re- 
veal considerable thought complexity, a weighting of arguments on both sides 
and preparation for possible critical challenges, resulting in changes in the 
mental representation of the topic (e.g. Tetlock et al 1989). 

Finally, ^when people have irrevocably committed themselves to a course 
of action, accountability will again motivate cognitive effort." Yet, in this 
case, “the result will be rigid, defensive, and evaluatively consistent thought. 
Accountability will prompt people to generate as many reasons as they can 
why they are right and potential critics wrong" (Tetlock 1992, p. 340), an ob- 
servation that is consistent with earlier work into the effects of commitment on 
attitude change (Kiesler 1971). 

These predictions have received considerable support across a wide range 
of judgment and decision tasks in laboratory experiments and field studies (see 
Tetlock 1992). Importantly, this research also demonstrated that the more 
careful and deliberate processing evoked by being accountable to an audience 
with unknown preferences can increase as well as decrease cognitive biases 
(e.g. Tetlock & Boettger 1996), an issue addressed in more detail in the section 
on judgmental biases. 


MINORITY AND MAJORITY INFLUENCE In light of the motivated tactician’s 
sensitivity to the social nature of the judgment situation, we may further expect 
that perceptions of the majority or minority status of a communicator influence 
how the communicator’s message is processed. Historically, social psycho- 
logical research into influence processes has typically focused on individuals’ 
compliance and conformity with a perceived majority, consistent with the pio- 
neering work of Sherif (1936) and Asch (1955). Moscovici (1976, 1980) chal- 
lenged this tradition by drawing attention to key differences in majority and 
minority influence. According to his conversion theory, opinion majorities ex- 
ert influence by eliciting public compliance and conformity, whereas opinion 
minorities may elicit private attitude change via informational routes, pro- 


COGNITIVE SOCIAL PSYCHOLOGY 251 


vided that they argue their point consistently and without compromise. The 
empirical support for Moscovici's (1980) original proposal and its refinements 
(Mugny & Perez 1991) is mixed (for a meta-analysis, see Wood et al 1994). 
One aspect that has received particular attention in recent research is whether 
perceived majority or minority support influences the extent to which recipi- 
ents of a message engage in systematic or heuristic processing. 

In general, people assume that their attitudes and opinions are widely 
shared by members of their group [a phenomenon known as the “false consen- 
sus" bias (Ross, Greene, & House 1977)], but not necessarily by others. When 
these assumptions are called into question, either through a counterattitudinal 
message from a majority or through a proattitudinal message from a minority, 
recipients are likely to think carefully about the message. Hence, they are in- 
fluenced when the message presents strong arguments, but not when it pres- 
ents weak arguments. Moreover, this attitude change is obtained on private 
measures (thus minimizing conformity pressures) and can be predicted on the 
basis of their thoughts about the arguments. In contrast, proattitudinal mes- 
sages from a majority, or counterattitudinal messages from a minority, are con- 
sistent with recipients’ expectations, receive less thought and are less likely to 
result in private attitude change (Baker & Petty 1994, Mackie 1987). Accord- 
ingly, the extent to which a message receives systematic elaboration and elicits 
private attitude change is not a function of its majority or minority support per 
se, in contrast to the assumptions of Moscovici's (1980) conversion theory, 
nor is it solely a function of message content. Instead, message processing de- 
pends on whether the combination of message content and source violates ex- 
pectations and hence requires thought (for different perspectives, see Bohner 
et al 1996, De Vries et al 1996, Wood et al 1994). 

Note, however, that conversion theory (Moscovici 1980) emphasizes that 
minorities need to argue their point consistently over extended periods to exert 
an influence, a condition that has not been met in persuasion experiments lim- 
ited to one message presentation. Nevertheless, the conceptual framework of 
dual-process models of persuasion promises to further our understanding of 
the differential processes uaderlying majority and minority influence (see 
Mackie & Skelly 1994), although more realistic procedures will be required to 
capture the dynamics of actual confrontation with majorities and minorities in 
face-to-face settings. 


SUMMARY As the selected lines of research illustrate, individuals' process- 
ing strategies are tuned to meet the requirements of the social situation. This 
work has extended the range of social cognition research from “thinking about 
social stimuli" to "thinking in a social context." Much of our thinking in social 
contexts, however, 1s intricately intertwined with conversational processes, to 
be addressed in the next section. 
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Cognition and Communication 


As Krauss & Fussell (1996, p. 655) noted, “Communication is one of the pri- 
mary means by which people affect one another, and, in light of this, one might 
expect the study of communication to be a core topic of social psychology, but 
historically that has not been the case.” In fact, social psychologists not only 
have neglected communication as a key topic of inquiry, but also have failed to 
take into account how communicative processes shape thinking in a social 
context. Focusing on individuals as isolated information processors, cognitive 
(social) psychologists have documented a wide range of biases and shortcom- 
ings of judgment that violate common sense and basic logic, calling the ration- 
ality of human judgment into question (for reviews, see Fiske & Taylor 1991, 
Nisbett & Ross 1980, Ross 1977). Recent research suggests, however, that 
many of the well-known fallacies can be traced ın part to the nature of commu- 
nication in research settings and do not necessarily reflect inherent shortcom- 
ings of human judgment. To understand the underlying processes, we need to 
consider how the communication between researchers and research partici- 
pants differs from communication in daily life. 

As a large body of research documented (for reviews, see Clark 1985, Lev- 
inson 1983, Sperber & Wilson 1986), social discourse proceeds according to a 
“co-operative” principle (Grice 1975), which can be expressed in the form of 
four maxims. A maxim of quality enjoins speakers not to say anything they be- 
lieve to be false or lack adequate evidence for, and a maxim of relation enjoins 
speakers to make their contribution relevant to the arms of the ongoing conver- 
sation. In addition, a maxim of quantity requires speakers to make their contri- 
bution as informative as is required, but not more informative than is required, 
while a maxim of manner holds that the contribution should be clear rather 
than obscure, ambiguous, or wordy. Accordingly, “communicated informa- 
tion comes with a guarantee of relevance” (Sperber & Wilson 1986, p. vi) and 
listeners are entitled to assume that speakers try "to be informative, truthful, 
relevant, and clear"—and listeners interpret speakers' utterances "on the as- 
sumption that they are trying to live up to these ideals" (Clark & Clark 1977, p. 
122). 

These tacit assumptions are routinely violated in research situations, and 
these violations contribute to a broad range of apparent biases of judgment and 
artifacts in survey measurement, as a few examples may illustrate (for exten- 
sive reviews, see Hilton 1995; Schwarz 1994, 1996). 


THE CONVERSATIONAL RELEVANCE OF “IRRELEVANT” INFORMATION  Injudg- 
ment research, experimenters as social communicators often introduce infor- 
mation that is neither informative nor relevant. However, participants have no 
reason to doubt the relevance of information provided to them in a serious re- 
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search setting and are "likely to seek relevance in any experimental message" 
(Kahneman & Tversky 1982, p. 502). As a consequence, they go beyond the 
literal meaning of the utterance and treat irrelevant information as relevant, re- 
sulting in judgmental errors relative to normative models that consider only 
the literal meaning of the utterance but not the implications of the communica- 
tive context. These errors are due to violations of conversational norms on the 
part of the experimenter and are obtained under circumstances that neither 
conform to conversational norms nor allow the insight that the usual conversa- 
tional maxims do not apply. 

For example, Kahneman & Tversky (1973) provided participants with a de- 
scription ofa target person who "shows no interest in political and social issues 
and spends most of his free time on his many hobbies which include home car- 
pentry, sailing, and mathematical puzzles." This person was said to be ran- 
domly drawn from a sample of engineers and lawyers, and participants had to 
determine the person's profession. They predicted that the person is most 
likely an engineer, independent of whether the base-rate probability for any 
person in the sample being an engineer was .30 or .70. These predictions indi- 
cate that participants relied on individuating information of little diagnostic 
value at the expense of more diagnostic base-rate information, thus violating 
normative (Bayesian) models of judgment (for a review of related studies, see 
Nisbett & Ross 1980 and Ginossar & Trope 1987; for a critique of the norma- 
tive assumptions, see Gigerenzer 1991). Does this imply, however, that they 
did not note that the personality sketch provided to them was uninformative? 
Or did they draw on this information because they inferred that the researcher 
wanted them to consider it—or else, why would it be presented to them 1n the 
first place? An extended replication of Kahneman & Tversky’s (1973) study 
supports the latter possibility (Schwarz et al 1991c). When the personality de- 
scription was provided as a narrative allegedly written by a psychologist, par- 
ticipants again concluded that the target person is an engineer, independent of 
the base-rate of lawyers and engineers in the sample. Yet, when the same de- 
scription was presented as a random sample of information about this person, 
allegedly drawn by a computer from a larger file assembled by psychologists, 
participants relied on the more diagnostic base-rate information to make a pre- 
diction. This reflects that a narrative written by the researcher comes with the 
guarantee of relevance that characterizes cooperative communication, a guar- 
antee that does not extend to a random sample of information drawn by a com- 
puter. Hence, participants tried to make sense of the personality information 
provided to them in the former case, but were happy to ignore it in the latter 
(see also Krosnick et a] 1990) 

Similar conversational analyses have been offered for other judgmental bi- 
ases that reflect reliance on normatively irrelevant information, including the 
fundamental attribution error, i.e. dispositional attributions on the basis of 
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nondiagnostic behavior (e.g. Wright & Wells 1988); the dilution effect, i.e. the 
observation that nondiagnostic information attenuates the impact of more di- 
agnostic information in impression formation (e.g. Tetlock & Boettger 1996); 
the conjunction fallacy (e.g. Dulany & Hilton 1991); and misleading question 
effects in eyewitness testimony (e.g. Dodd & Bradshaw 1980). In fact, when 
explicitly asked, participants usually seem aware that the normatively irrele- 
vant information is of little informational value (e.g. Miller et al 1984). Never- 
theless, they typically proceed to use it in making a judgment because the sheer 
fact that 1t has been presented renders it conversationally relevant in the given 
context. Once the “guarantee of relevance” is undermined, participants are less 
inclined to see relevance in the information provided to them and the impact of 
normatively irrelevant information is attenuated (see Hilton 1995, Schwarz 
1996). Moreover, increasing individuals’ motivation to arrive at a defensible 
judgment, e.g. by making them accountable, does not attenuate reliance on 
normatively irrelevant information presented by the researcher. To the con- 
trary, it increases their efforts to find meaning in the material presented to them 
(e.g. Tetlock & Boettger 1996). 

In summary, the procedures typically used in psychological research are 
likely to result in an overestimation of the size and the pervasiveness of judg- 
mental biases. Note, however, that a conversational analysis does not imply 
that violations of conversational norms are the sole source of judgmental bi- 
ases. Like most robust phenomena, judgmental biases are likely to have many 
determinants. If we are to understand their operation in natural contexts, how- 
ever, we need to ensure that their emergence in laboratory experiments does 
not reflect the operation of determinants that are unlikely to hold in other set- 
tings. 


CONVERSATIONAL RELEVANCE AND RESPONSE EFFECTS IN SURVEY RESEARCH 
These considerations of socially situated cognition in the research context ex- 
tend beyond the psychological laboratory and apply as well to survey inter- 
views. Like participants in an experiment, survey respondents draw on the re- 
searcher's contributions to the interview to determine the meaning of the ques- 
tions posed to them. The researcher's contributions include the content of pre- 
ceding questions as well as apparently formal characteristics of the question- 
naire, such as the numeric values presented as part of a rating scale, as a few 
examples may illustrate (for reviews, see Clark & Schober 1992, Schwarz 
1996, Strack 1994, Sudman et al 1996). 

To survey researchers' concern, about 3096 of the respondents of any sam- 
ple are willing to provide opinions on highly obscure or even completely ficti- 
tious issues, such as the "Agricultural Trade Act of 1978" (e.g. Bishop et al 
1986, Schuman & Presser 1981). Such findings apparently document that the 
fear of appearing uninformed may induce “many respondents to conjure up 
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opinions even when they had not given the particular issue any thought prior to 
the interview" (Erikson et al 1988, p. 44; Converse 1964). Yet, the sheer fact 
that a question about some issue 1s asked presupposes that the issue exists—or 
else asking a question about it would violate every norm of conversational 
conduct. Having no reason to assume that the researcher would ask meaning- 
less questions, respondents turn to the context of the ambiguous question to 
make sense of it, much as they would be expected to do in any other conversa- 
tion. Once they assign a particular meaning to the issue, they may have no dif- 
ficulty reporting a subjectively meaningful opinion. Supporting this reasoning, 
Strack et al (1991) observed that German university students favored the intro- 
duction of a fictitious “educational contribution” when a preceding question 
pertained to fellowships for students but opposed 1t when it pertained to tui- 
tion. Open-ended responses confirmed that they used the content of the pre- 
ceding question to determine the nature of the fictitious issue. While findings 
of this type (see Schwarz 1996) provide the comforting assurance that respon- 
dents do not conjure up opinions to avoid appearing uninformed, they again 
highlight the extent to which insensitivity to conversational processes may 
foster misleading conclusions about human cognition and behavior. 

In addition to drawing on the content of preceding questions, respondents 
make systematic use of apparently formal features of questionnaire design in 
determining question meaning. For example, when asked how successful they 
have been in life, on a scale ranging from “not at all successful” to “extremely 
successful,” respondents aave to determine what the researcher means by “not at 
all successful.” Does this term refer to the absence of great achievements or to 
the presence of explicit failures? Empirically, they adopt the former interpreta- 
tion when the numeric values of the rating scale run from 0 to 10, but the latter 
when the values run from -5 to +5, resulting in markedly different responses 
(Schwarz et al 1991b). In general, a scale format that runs from negative to 
positive numeric values indicates that the researcher has a bipolar dimension 
in mind (e.g. presence of failure to presence of success), whereas the use of 
only positive values indicates a unipolar dimension (e.g. different degrees of 
success). Similar considerations apply to the influence of other response for- 
mats, including differences between open and closed question formats, the use 
of “don’t know” filters, and the impact of frequency scales on behavioral re- 
ports, all of which can be fruitfully conceptualized in a conversational infer- 
ence framework (Clark & Schober 1992, Schwarz 1996). Throughout, what 
appear as artifacts of meaningless formal features of questionnaires can be 
traced to respondents' systematic use of contextual information in an attempt 
to determine the intended meaning of the question and the nature of their task. 


SUMMARY As these examples illustrate, many apparent biases in social judg- 
ment and artifacts in survey measurement do not necessarily reflect inherent 
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shortcomings of human judgmental processes. Instead, they reflect that 
research participants are cooperative communicators who draw on the re- 
searcher's contributions to the ongoing conversation in an attempt to provide 
useful information. Unfortunately, they miss one crucial point: Although the 
researchers are likely to observe conversational norms in all other situations, 
they may violate them in the research context by deliberately presenting in- 
formation that is irrelevant to the task at hand or by inadvertently introducing 
features that carry information of which they are not aware. Unless we learn 
to take the information conveyed by our research procedures into account, we 
run the risk of painting an inappropriately negative picture of human judg- 
ment. 


The Motivated Tactician's Limits: Automaticity, Implicit 
Cognition, and Mental Control 


The research reviewed so far highlighted individuals' ability to flexibly tune 
their thought processes to the requirements at hand. Yet “you can’t always 
think what you want" (Wegner 1992, p. 193), and a review of recent develop- 
ments in cognitive social psychology would be incomplete without drawing at- 
tention to social psychologists' increasing interest in automatic, nonconscious 
influences on thought and behavior and the limits of mental control (for com- 
prehensive reviews, see Bargh 1994, Greenwald & Banaji 1995, Wegner & 
Wenzlaff 1996, Wegner & Bargh 1998). 


AUTOMATICITY AND IMPLICIT COGNITION  Implicit cognitions are "intro- 
spectively unidentified (or inaccurately identified)" traces of past experiences 
that influence judgments or behaviors (Greenwald & Banaji 1995, p. 5). Pre- 
dating interest in this domain by several decades, Zajonc (1968) demonstrated, 
for example, that repeated exposure to an unknown stimulus (e.g. a Chinese 
idiograph) increases liking for the stimulus. Importantly, effects of previous 
exposure on liking are obtained under conditions where participants cannot 
recognize the stimulus as having been previously presented (Kunst-Wilson & 
Zajonc 1980) and are eliminated when participants are aware of the previous 
exposure (Bornstein & D’ Agostino 1992). Effects of this type reflect that pre- 
vious exposure leaves memory traces that increase the fluency with which the 
stimulus can be processed later (Jacoby 1989), and that the experience of per- 
ceptual fluency can itself serve as a basis of judgment (e.g. Reber et al 1998). 
Such dissociations between "explicit memory" (what one is able to recall or 
recognize) and “implicit memory" (memory traces inferred from an impact of 
the learning experience on other measures) have received considerable atten- 
tion in cognitive psychology (see Jacoby & Kelley 1987 and the contributions 
in Reder 1996). Extending this work, social psychologists have begun to ex- 
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plore the role of such unconscious influences in judgment and behavior 
(Greenwald & Banaji 1995). 

An unconscious influence of particular interest is the automatic activation 
of stereotypes (e.g. Banaji et al 1993, Devine 1989), which may affect indi- 
viduals' behavior outside of awareness. For example, Bargh et al (1996) asked 
participants to form meaningful sentences from strings of words that contained 
elements related to the elderly stereotype, like “forgetful,” "Florida," or 
"bingo." Although none of these words was explicitly related to the concept of 
“slowness,” this concept, which is part of the general elderly stereotype, was 
apparently activated and influenced participants’ behavior: After ostensible 
completion of the experiment, participants primed with the elderly stereotype 
walked more slowly to the elevator than participants exposed to other con- 
cepts. A growing body of research illustrates the role of such automatic and un- 
conscious influences in everyday life (for a review, see Bargh 1997) and their 
controllability and interplay with conscious cognition is the topic of consider- 
able debate (see the contributions in Wyer 1997). 


MENTAL CONTROL  Importantly, research in this domain not only docu- 
mented pervasive effects of automatically activated knowledge structures, but 
also suggested that attempts to avoid unwanted influences of knowledge acti- 
vation may often backfire. Suppose that a person anticipates an interaction 
with a skinhead and wants to avoid any undue influences of stereotypic knowl- 
edge about the group on the impression formed of the exemplar. To accom- 
plish this, the person may attempt to suppress the stereotype, essentially trying 
not to think *skinhead." Unfortunately, this effort is likely to be of limited suc- 
cess, as Macrae et al (19948) demonstrated. Although we can suppress un- 
wanted thoughts for a limited period, we are only successful at doing so when 
no other task taxes our cognitive resources. When we are distracted by another 
task, or terminate our efforts when no longer required, the unwanted thoughts 
bounce back with renewed vigor (for reviews, see Wegner & Wenzlaff 1996, 
Wegner & Bargh 1998, and the contributions in Wegner & Pennebaker 1993). 
These rebound effects reflect that thought suppression involves a monitoring 
process that checks if the unwanted thoughts intrude into consciousness. Ironi- 
cally, this monitoring process increases the activation of the unwanted mate- 
rial in memory. Consequently, the unwanted material becomes highly accessi- 
ble, and we may end up thinking, and doing, exactly what we wanted to avoid 
when the cognitive resources required for suppression are taxed. 


SUMMARY As these examples illustrate, a growing body of research demon- 
strates pervasive unconscious influences of automatic knowledge activation 
on judgment and behavior and highlights the limits of intentional mental con- 
trol. At present, the interplay of conscious and unconscious processes in social 
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judgment and behavior is not well understood, but its exploration is likely to be 
a focal area of social cognition research in the near future. 


CONCLUSIONS 


In sum, cognitive social psychology complemented its previous emphasis on 
“cold” cognition and individuals as isolated information processors with a re- 
newed interest in the interplay of feeling and thinking and the exploration of 
social 1nfluences on human cognition. The picture that emerges from these de- 
velopments is compatible with social psychology's latest metaphor, humans as 
motivated tacticians (Fiske & Taylor 1991) who pragmatically adapt their 
processing strategies to the requirements of the situation at hand in an effort to 
get things done. Hence, they pay close attention to complex information and 
engage in systematic and effortful processing strategies when required, but 
rely on cognitive shortcuts, simple heuristics, and preexisting knowledge 
structures in the perceived absence of such requirements; and they engage in 
rigid and defensive processing once committed to a position or course of ac- 
tion (Gollwitzer 1990, Tetlock 1992). To determine the requirements at hand, 
they attend to a wide range of internal and external cues. They are more likely 
to pay attention to the details at hand when negative feelings signal a problem- 
atic situation and to rely on general knowledge structures and simple heuristics 
when positive feelings signal a benign situation (Schwarz & Clore 1996). 
Similarly, they attend to complex information about others when the other is 
powerful and may affect their outcomes, but not otherwise (Fiske & Depret 
1996), and they consider the likely interpersonal consequences of their judg- 
ments and decisions when they are held accountable, but not otherwise (Tet- 
lock 1992). The conditions under which the motivated tactician is most likely 
to get 1t wrong are conditions under which normally adaptive assumptions are 
thwarted, as 1s the case when researchers present irrelevant information under 
circumstances that suggest otherwise (Hilton 1995, Schwarz 1996). Yet, this 
emphasis on the pragmatic and adaptive nature of social thinking needs to be 
balanced by a recognition of the pervasive role of unconscious influences on 
social judgment and behavior (Bargh 1997, Greenwald & Banaji 1995), which 
are likely to be more fully understood in the near future. 

Depending on the normative perspective taken, we may lament the errors 
that the motivated tactician makes or praise the flexible adaptation of process- 
ing strategies to the individual's goals. Over the past decades, social psycholo- 
gists have embraced Bayesian probability theory and related normative mod- 
els as the gold standard against which human judgment is to be evaluated, and 
found human judgment lacking (for different perspectives, see Nisbett & Ross 
1980; Gigerenzer 1991, 1996; Kahneman & Tversky 1996). In recent years, 
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the field has increasingly adopted a more pragmatic perspective (e.g. Fiske 
1992, 1993; Funder 1987). 


Before labeling an effect a cognitive flaw, one should consider (1) the inter- 
personal, institutional, and political goals that people are trying to achteve by 
making judgments of a particular type (e.g. Do people seek to achieve causal 
understanding or to express their moral approval/disapproval? Do people at- 
tempt to maximize expected utility or to minimize risk of serious enticism?) 
and (2) whether cognitive strategies that serve people well in everyday life 
may lead them seriously astray 1n laboratory experiments on judgment and 
choice. (Tetlock 1992, p. 360) 


The accumulating evidence indicates that the fallibility of human judgment 
is less detrimental than normative analyses would suggest and that people may 
be “good enough" (Fiske 1992) thinkers to get things done, true to William 
James's (1890, p. 333) credo that *my thinking is first and last and always for 


the sake of my doing." 
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ABSTRACT 


There has been rapid growth in the study of diffusion across organizations 
and social movements in recent years, fueled by interest 1n institutional argu- 
ments and ın network and dynamic analysis. This research develops a socio- 
logically grounded account of change emphasizing the channels along which 
practices flow. Our review focuses on characteristic lines of argument, em- 
phasizing the structural and cultural logic of diffusion processes. We argue 
for closer theoretical attention to why practices diffuse at different rates and 
via different pathways in different settings. Three strategies for further de- 
velopment are proposed: broader comparative research designs, closer 1n- 
spection of the content of social relations between collective actors, and 


more attention to diffusion industries run by the media and communities of 


experts. 


What we really need 1s some new heroes 1n Engineering. I took that word 
from Deal’s culture book, and I’m trying to 1dentify the Engineering heroes. 
Divisional Manager (Kunda 1992, p. 100) 


They are making more out of this culture stuff than it's worth.. I never read 
that stuff, maybe see 1t 1n passing It's the same nauseating stuff they print in 
Business Weck. 

Group Manager (Kunda 1992, p. 180) 
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INTRODUCTION 


As the above quotations suggest, skillful players in business and other arenas 
display a keen sense of fashions and movements within their spheres of action. 
Much as academics are aware of intellectual currents and exemplars in their 
fields, we may be confident that executives know what new developments are 
hot and which are not, and that political activists are attuned to successes and 
disappointments elsewhere. And as the quotes emphasize, individuals counter 
as well as endorse and employ the cultural materials provided by a larger sys- 
tem of discourse. 

Diffusion studies work with this awareness and its consequences by exam- 
ining how practices spread. They provide an opportunity to locate and docu- 
ment social structure, where we consider how patterns of apparent influence 
reflect durable social relations. And they provide an opportunity to observe the 
cultural construction of meaning, where we learn how practices are locally and 
globally interpreted, and ask why some practices flow while others languish. 

This review treats contemporary uses of diffusion arguments within the 
fields of organizations and social movements. Diffusion imagery, models, and 
explanations are on the rise in both fields and with clearly productive effect. 
We seek to map the logic of these developments, emphasizing characteristic 
lines of argument, methods, and research designs. At the same time, we strike a 
cautionary note, arguing that theoretical advance requires closer attention to 
both structural and cultural bases of diffusion. 


CONCEPTUAL OVERVIEW 


Diffusion refers to the spread of something within a social system. The key 
term here is "spread," and it should be taken viscerally (as far as one's con- 
structionism permits) to denote flow or movement from a source to an adopter, 
paradigmatically via communication and influence. We use the term “prac- 
tice" to denote the diffusing item, which might be a behavior, strategy, belief, 
technology, or structure. Diffusion 1s the most general and abstract term we 
have for this sort of process, embracing contagion, mimicry, social learning, 
organized dissemination, and other family members. 

The term "diffusion" 1s sometimes used in an alternative sense to denote in- 
creasing incidence: Something diffuses when more and more people do it. But 
treatment of diffusion as an outcome makes it uninteresting, since practices 
rise and fall in frequency for every possible reason. We thus focus on diffusion 
as a kind of causal process and seek to map some major lines of argument and 
important findings. 

Diffusion arguments cannot be segregated easily from other causal dynam- 
ics. They verge on the one hand toward models of individual choice, since dif- 
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fusion models often treat the adopter as a reflective decision-maker. They 
verge on the other hand toward a broader class of contextual and environ- 
mental processes, where conditions outside the actor shape behavior. While it 
is easy to see when one has strayed much too far (analyses of the diffusion of 
puberty or the diffusion effects of gender composition on job satisfaction), 
useful hard and fast rules are not readily apparent. 

Rather than patrol the boundaries, we focus attention on lines of research 
with affinities to the core notions underlying diffusion. These include models 
that attend explicitly to flows of material along social relations, efforts of ex- 
ternal change agents to promote adoption, and interpretive work aligning 
sources and adopters. The emphasis is on processes treated as involving mean- 
ingful behavior on the part of both source and adopter.! 


Classical Diffusion Studies 


All lines of argument have empirical fields of application to which they are 
particularly suited. The home territory of diffusion is the innovation. Innova- 
tions are novel (at least to the adopting community), making communication a 
necessary condition for adoption. Innovations are also culturally understood as 
progressive, strengthening the hand of change agents. And since innovations 
are risky and uncertain, adopters carefully weigh the experience of others bef- 
ore acting. The elective affinity between diffusion and innovation is so strong 
that we sometimes think of diffusion as the only causal process underlying the 
adoption pattern of innovations. 

Diffusion studies thus generally investigate the introduction and adoption 
of an innovation. Classic studies include Ryan & Gross's (1943) analysis of 
the diffusion of hybrid corn, Hagerstrand's (1967) 1nvestigation of the diffu- 
sion of innovations such as the telephone and tests for tuberculosis involving 
the destruction of cattle in rural Sweden, and Coleman et al’s (1966) analysis 
of the diffusion of a prescription drug in four Midwestern cities. 

These studies focused directly on communication processes and channels, 
tracing the role of the mass media, professional change agents, and interper- 
sonal interaction within the adopting community. Adoption patterns and self- 
reports pointed to the impact of external sources in introducing the innovation 
to cosmopolitans, and the cascading of adoption via relational networks within 
communities [most famously 1n Katz & Lazarsfeld’s (1944) two-step flow of 
influence]. Relative innovativeness was explained largely by modern values 
and institutional markers of this orientation such as educational background, 
probably because the acceptance of modern, scientific practices was at issue. 
Rogers (1995) authoritatively reviews this literature. 


lA quite different theoretical orientation would be “practice-centric,” attending to the flow of 
resourceful practices across a landscape of carriers. 
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Contemporary "Macro" Diffusion Research 


Diffusion arguments go in and out of style in sociology as in other disciplines. 
There is the greatest continuity in interpersonal studies of contagion and influ- 
ence, but even here their fortunes are tied to relevance to empirical problems. 
For example, efforts to model the spread of HIV/AIDS has generated much 
important diffusion research (see the 1995 special issue of Social Networks). 
Interest in diffusion processes is also a function of broader intellectual move- 
ments, such as the role of social science in supporting the spread of modernizing 
innovation. 

In this review we treat not the rich contemporary literature on interpersonal 
influence but instead the recent development of more “macro” diffusion analy- 
sis in two fields: social movements and organizations. In the study of social 
movements, views of contagion as the irrational, spontaneous transmission of 
antisocial behavior (LeBon 1897, Tarde 1903, Kornhauser 1959) have given 
way to nuanced studies of diffusion as reflecting “normal learning and influ- 
ence processes as mediated by the network structures of everyday life" (Mc- 
Adam 1995, p. 231). Diffusion processes play a central role in contemporary 
explanations of the incidence of collective action and the spread of protest 
symbols and tactics.? 

Diffusion arguments also flourish when there is theoretical attention to the 
larger environment, to the way cultural models condition behavior, and to his- 
torical context and change rather than comparative statics. The new institu- 
tionalism (Powell & DiMaggio 1991) has precisely these emphases, and much 
diffusion research emerges in organizational studies where this school is most 
influential. Institutional lines of argument also appear in the social movement 
literature, as does network imagery in organizational research, so that diffu- 
sion studies in the two fields are fairly strongly connected. 

Diffusion research in these fields differs in obvious ways both from the 
classics of the genre and from current work on interpersonal diffusion. Con- 
temporary work on organizations and social movements typically examines 
the spread of behavioral strategies and structures rather than technical innova- 
tions, emphasizes adoptions by social collectivities more than individuals 
within those collectivities, works with a much larger historical and spatial can- 
vas, and incorporates diffusion as one sort of explanation rather than as the 
overarching framework. As one example, Fligstein (1985) evaluates five theo- 
ries of the rise of the multidivisional form across the nation's largest firms over 
the twentieth century, one of which involves imitation. 


2The literature on recruitment to activism also emphasizes the effects of network ties. See 
Curtis & Zurcher (1973), Snow et al (1980), McAdam (1982, 1988), Morris (1984), McAdam & 
Paulsen (1993), and McCarthy (1996). 
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Given this context, contemporary diffusion research on social movements 
and organizations can learn from the classics but should not blindly copy them. 


INITIAL ELEMENTS OF A DIFFUSION ARGUMENT 


We briefly flag two important concerns that play a role in all kinds of diffusion 
arguments but that for present purposes are treated contextually rather than 
within our main story line. 


What Is Observed? 


While most diffusion research emphasizes that adopters are 1nfluenced by 1m- 
mediate or second-hand observation of the diffusing practice, there is often 
much ambiguity about what is actually observed. Sometimes we treat the po- 
tential adopter as exposed to the practice itself. This involves discovering that 
something is possible, witnessing it in action, or hearing secondhand about its 
objectives, rationale, and operation. For example, executives may come into 
contact with poison pills when they sit on the boards of other firms that have 
instituted them (Davis 1991), managers may learn which markets leading 
firms enter (Haveman 1993), and activists in Switzerland may hear about pro- 
tests in the Netherlands (Kriesi et al 1995, Chapter 8). 

A potential adopter mav also observe the consequences of a practice. To 
continue the above examples, one might measure contact with companies that 
had successfully warded off takeovers by wielding the pill, or calculate rates of 
return for firms that enter various markets, or contrast situations in which pro- 
tester demands were met to those in which they were not. 

The contrast between observing practices and observing their outcomes is 
tied only loosely to a contrast between diffusion as mimicry and diffusion as 
social learning. One can readily motivate diffusion in rational choice-theoretic 
terms even when no information about consequences is provided (Banerjee 
1992). And consequences may be implicit in descriptions of the practice or un- 
interpretable without close local knowledge or a good theory. 

Research that directly measures the consequences of adoption elsewhere 
suggests that both are salient. Conell & Cohn (1995) find that French coal min- 
ing strikes were stimulated by other strikes in the same department but most 
strongly by victorious ones. And Holden (1986) shows that hijacking attempts 
were stimulated by prior hijackings, especially when a ransom was paid. 

In most studies, however, these distinctions are not or cannot be made. We 
typically know that potential adopters are brought into contact with the diffus- 
ing practice but do not know quite what they see, particularly whether they ob- 
serve results. This inability to specify what is observed produces some theo- 
retical fuzziness about the microprocesses involved in diffusion. 
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Innovativeness 


We also flag the issue of innovativeness, a topic that forms the flip-side of dif- 
fusion studies (see Kimberly 1981, Drazin & Schoonhoven 1996 for excellent 
discussions of the organizational literature). Innovation research asks what 
makes organizations capable of devising or adopting new technologies and 
practices? 

While some critics have regarded the literature as beyond interpretation 
(Downs & Mohr 1976), fairly consistent findings emerge (Damanpour 1991). 
Large, technically specialized organizations with low levels of formalization 
and centralization tend to innovate rapidly (Burns & Stalker 1961). Exposure 
to external competition and rapidly shrinking markets provide external spurs 
to innovation (for example, Osterman 1992, Studer-Ellis 1997). Internally, the 
adoption of new practices requires the active efforts of innovation champions 
and a robust coalition for change. 

These lines of inquiry are relevant to diffusion analysis but ambiguously so, 
since they conflate openness to diffusion with internal inventiveness. In addi- 
tion, diffusion studies tracking specific practices must attend to the congru- 
ence between adopter and practice at least as much as generalized innovative- 
ness. Large, technically complex organizations may be quick to adopt innova- 
tions designed to handle information overload (Burns & Wholey 1993) but 
slow to adopt other practices such as “beer bash Fridays.” 

And while generalized innovativeness and particular congruences help us 
explain relative adoption rates of specific practices, neither contributes funda- 
mentally to a theoretical analysis of diffusion. For that, we must examine com- 
munication and influence within the community where practices diffuse. 


SOURCES AND STRUCTURAL MECHANISMS 


Diffusion studies are rich in structural mechanisms: characteristic relations be- 
tween source and adopter that promote diffusion. Conceptual work in the area 
tends to bring previously overlooked pathways and logics into sharp focus. 
Among the classics of this genre are Granovetter’s “The Strength of Weak 
Ties” (1973) and DiMaggio & Powell’s “The Iron Cage Revisited” (1983). 4 

The discussion builds from perhaps the most central opposition: diffusion 
into a population (external source or broadcast models) vs diffusion within a 
population (internal or contagion models), The two may operate in tandem, as 


JThe social movement literature has been much less concerned with variability in 
innovativeness, though Tilly (1978) and Tarrow (1994) emphasize a long histoncal evolution 
toward more flexible repertoures of contention 

^DiMaggio & Powell's discussion of homogenizing processes may be read as a conceptual 
mapping of diffusion mechanisms Their account of coercrve, mimetic, and normative sources of 
homogeneity mtersects at many points with our discussion 
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when people heard of John Kennedy's assassination on the radio and ran out into 
the streets to tell their neighbors. But internal and external sources often play 
different roles in a diffusion analysis and imply different adoption trajectories. 


External Sources 


The key external sources in classic diffusion research were mass media outlets 
like the newspaper, TV, and radio, and change agents such as the Farm Bu- 
reau's extension agent and the pharmaceutical company's detail man. Contem- 
porary analyses of diffusion in organizations and social movements point to 
the same kinds of sources, often viewed more collectively (for example, ef- 
fects of the national business press or the legal community). 


MASS MEDIA The mass media plays a crucial role in amplifying and editing 
the diffusion of collective action, and much protest today is organized around 
that fact. Spilerman (1976) explains the temporal clustering of urban riots in 
the 1960s by arguing that television drew national attention to riots in Newark 
and Watts, creating a “black solidarity that transcended bounds of communi- 
ties" (p. 790). Oberschall (1989) argues that the sit-in tactic diffused via the 
mass media: Students watched what other students were doing on the news and 
then staged their own sit-ins. Koopmans (1993) points out that the news media 
do much of the job of social movement organizers during periods of height- 
ened mobilization and conflict. 

The business media broadcast the stories of corporate heroes, depict best 
practice, and advertise managerial innovations and strategies. The business 
press introduces new innovations with glowing reports and later critiques both 
adopter and practice as faddish (Abrahamson & Fairchild 1997, Strang 1997). 
High levels of media attention speed the introduction of innovations like ma- 
trix management (Burns & Wholey 1993) and prompts mergers and acquisi- 
tions (Haunschild & Beckmann 1997) by providing information that comple- 
ments that garnered via interorganizational ties. 


CHANGE AGENTS Much recent organizational analysis treats the state and the 
professions as change agents that spread new practices and facilitate particular 
lines of innovative action. State policy instruments range from coercive man- 
dates to cheerleading and often form a complex balance of the two. For exam- 
ple, Baron et al (1986) trace the diffusion of modern personnel practices to the 
mandates and infrastructure introduced by the state during World War II. Leg- 
islation on equal rights and affirmative action motivated personnel practices 
that build internal labor markets (Dobbin et al 1993), and weak federal spon- 
sorship of HMOs precipitated state legislation and shifts in HMO population 
dynamics (Strang & Bradburn 1993). 

The professions and other occupational communities form an allied source of 
new practices. They frequently mediate legal and policy imperatives: Lawyers 
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construct recipes for meeting ambiguous mandates for affirmative action (Edel- 
man 1990, 1992), which human resources professionals translate into standar- 
dized procedures (Sutton & Dobbin 1996). The accounting profession devises 
and disseminates organized responses to changing IRS regulations (Mezias 1990). 

Other communities of experts operate more autonomously in the market for 
corporate efficiency. In the 1980s, organizational consultants and scholars in- 
terpreted Japanese business practice for the American manager (Ouchi 1981, 
Pascale & Athos 1981), and management faculty taught MBA s the virtues of 
the multidivisional form (Palmer et al 1993). Business consultants also devise 
and market innovations, from how to become personally effective (Covey 
1989) to how to restructure organizations (Hammer & Champy 1994). Expert 
communities are internally organized and differentiated, most notably in the 
way academics enter the fray after the battle is over (Strang 1997) and move 
toward the arguments of practitioners (Barley et al 1988). 

In social movements, experts cannot be distinguished so easily from adopt- 
ers, as activists move seamlessly across the two roles. But it is clear that strate- 
gies and tactics are often imported into local settings. Morris (1981) and Mc- 
Adam (1988) discuss the role of nonviolence workshops and training sessions 
conducted by outside activists in the civil rights movement. And many move- 
ments draw inspiration from social movement gurus such as Gandhi or Edward 
Abbey (whose book The Monkey Wrench Gang promoted controversial tactics 
like tree-spiking to halt the cutting of timber). 


Internal Influence 


Internal diffusion processes operate via information and influence flowing 
within the adopting population. Most often, especially in formal models, the 
flow is assumed to move grapevine-like from prior to potential adopters. This 
process focuses attention on interaction networks as the conduits of diffusion. 

Classical formal models of intrapopulation diffusion also assume spatial 
homogeneity, where all members of the population have the same chance of 
affecting and being affected by each other. But few substantive arguments 
work this way. Instead, sociologists take advantage of intrapopulation diffu- 
sion to search for and document social structure. 


COHESION THROUGH STRONG TIES The classic emphasis in analyses of face- 
to-face interaction treats influence as flowing along the lines of close social re- 
lations. Frequent interaction engenders much exchange of information about 
the character, motivations, and effects of diffusing practices. Particularly 
when organized by homophily, strong ties lead actors to take the perspective of 
the other and to exert powerful pressures for conformity.? Balance theoretic 


5Some may recall the often stifling character of these pressures in the setting of the small town; 
others find a more compelling parallel ın the atmosphere of the university department 
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notions (Heider 1946) and their generalizations predict homogeneity within 
cliques (Davis 1967). 

Some of these ideas surface in discussions of the benefits of strong, dense 
networks for organizing collective action. For example, Morris's (1981) ac- 
count of the diffusion of protest tactics in the civil rights movement points to 
the strong and durable relationships linking black churches, colleges, and 
movement organizations such as the Southern Christian Leadership 
Conference). Mizruchi (1992) finds that corporations that constrain the profits 
of another firm also tend to influence the other firm's political behavior. 

Analyses of organizational cultures and internal decision-making offer par- 
allel accounts. In particular, Friedkin (1984, 1996) combines direct and short in- 
direct paths to produce measures of structural cohesion. The social circles that 
emerge from this approach locate regions of consensus on controversial policy 
issues. 


NEWS THROUGH WEAK TIES Granovetter (1973) suggests that new informa- 
tion may travel via weak ties rather than strong ones. The argument is that 
strongly related partners share many ties to third parties and so have little new 
to report to each other, while the social circles of weakly tied actors overlap 
less. Presumably the channel capacity of a weak tie is more restricted, how- 
ever, making it a conduit for news rather than resocialization. 

The well-documented role of interlocking directorates in organizational diffu- 
sion may perhaps be best understood as analogous to a weak interpersonal tie 
(though they are often discussed under the rubric of cohesion). These structures 
permit “business scan” (Useem 1984), as top managers gain a glimpse of what 
other firms do. For example, firms are more likely to adopt poison pill defenses 
against hostile takeover (Davis 1991), to adopt multidimensional forms (Palmer 
etal 1993), and to engage in takeover efforts (Haunschild 1993) if their manag- 
ers sit on the boards of firms that have previously engaged in these activities. 

The analogy to Granovetter's weak ties is not entirely apt, since board inter- 
locks familiarize executives with novel strategies more than inform them of their 
existence (Davis 1991). But it seems implausible that board interlocks produce a 
parallel to the mutual socialization produced by cohesive interpersonal relations. 
Overall, board interlocks appear a relatively thin sort of linkage important for 
the flow of information about “high” corporate strategy (for example, mergers, 
CEO compensation, and prestigious innovations such as massive downsizing), 
but they are less relevant to other kinds of organizational innovations. 

In the study of social movements, collective action often diffuses via weak 
ties carrying the news of what others have done. Rude (1964) points to the dif- 
fusion of collective action along transportation routes in England and France 
between 1730 and 1848, where travelers carried the news. Skinner (1964) de- 
tails the intervillage networks facilitating peasant rebellions in China. 
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Bohstedt & Williams (1988) argue that market networks facilitated the spread 
of food riots across Devonshire in the late eighteenth century. And Gould 
(1991) shows how weak ties among Parisian neighborhoods helped mobilize 
support for the Paris Commune. 


STRUCTURAL EQUIVALENCE AND COMPETITION Burt (1987) argues that 
structurally equivalent actors (those possessing similar ties to others) attend 
carefully to each other. He motivates the argument via a logic of competition: 
We keep up with the Joneses because we cannot afford to fall behind, most im- 
portantly in managing our mutual relation to the Smiths. As Friedkin (1984) 
observes, however, apparent diffusion via structural equivalence may repre- 
sent the effects of similar patterns of contact with third parties. 

Reanalyses of Coleman et al's Medical Innovation find that structurally 
equivalent doctors tend to adopt in tandem (Strang & Tuma 1993, Burt 1987). 
Galaskiewicz & Burt (1991) show that structurally equivalent pairs of corpo- 
rate loan officers had closely aligned perspectives on local charities. And 
Mizruchi (1992) finds indirect interlocks to financial institutions a strong pre- 
dictor of similar political contributions (though as Mizruchi notes, this may be 
interpreted as bank influence). 

More prosaic forms of competition also generate mimicry. Much evidence 
suggests that firms in competition are highly responsive to each other's efforts 
at innovation. Japanese managerial and production practices diffused most 
quickly to firms exposed to external competition (Osterman 1992). Firms 
mimic those in their industry (Fligstein 1985, 1990), and states the policies of 
other states (Zhou 1993). In the social movements arena, Tarrow (19892) ar- 
gues that competition between protest organizations drives the diffusion of 
disruptive tactics as groups seek to outbid each other. 

But these examples suggest that while competition often spurs imitation, it 
may also spur differentiation. Firms and social movements want to keep up 
with their competitors—but they also want to outdo them and to keep their dis- 
tance. Thus Greve (1995, 1996) shows that radio stations do not imitate the 
strategic moves of stations 1n local markets (which would intensify competi- 
tion). Instead, decisions are influenced by the behavior of sister stations in 
other markets and the behaviors that those sister stations come into contact 
with. Becker (1998) suggests that local congregations distance their programs 
and mission from other local congregations of the same denomination (with 
whom they most directly compete for adherents) while learning from congre- 
gations of other denominations. 


PRESTIGE While the above social relations are all symmetric, adopters may be 
influenced strongly by prestigious, central actors in ways that are not recipro- 
cated. Both social psychological and structural] mechanisms are involved: 
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Lower ranking community members aspire to be like prestigious others, find it 
useful to resemble powerful leaders, and adoptions by central actors shift com- 
munity norms or interaction patterns sufficiently that others find it hard not to 
go along. 

For example, Fligstein (1990) argues that models of management diffuse 
from central firms to the larger business community as they prove their utility 
in responding to new politico-economic conditions. Haveman (1993) shows 
that deregulation led thrifts to follow large, financially profitable thrifts into 
new markets. And Han (1994) argues that mid-sized companies use the ac- 
counting firms that the largest firms in their industry employ, while large firms 
seek to differentiate themselves from each other.Ó 


SPATIALPROXIMITY Perhaps the most common finding 1n diffusion research 
is that spatially proximate actors influence each other. No distinctive logic can 
be proposed—rather, spatial proximity facilitates all kinds of interaction and in- 
fluence. Where network relations are not mapped directly, proximity often 
provides the best summary of the likelihood of mutual awareness and interde- 
pendence. 

In some work, spatial proximity is measured by pairwise distances. Knoke 
(1982) shows effects of geographic proximity on the spread of municipal re- 
form. Hedstrom (1994) shows how the Swedish trade union movement ex- 
panded geographically. Petras & Zeitlin (1967) argue that radical ideology in 
Chile (measured by support for Allende) spread from mining communities to 
adjacent agricultural communities. And in a careful reanalysis of Spilerman's 
data using event history methods, Myers (1997) finds that the propensity to riot 
falls with distance from cities where riots have occurred. 

Other studies examine contagion within spatially defined regions that may 
possess both high levels of interaction and a common sense of identity. For ex- 
ample, Davis & Greve (1997) point to the diffusion of golden parachutes via 
local business communities, while Burns & Wholey (1993) locate regional in- 
fluences on the adoption of matrix management.’ 


CULTURAL CATEGORIES Finally, reference groups may be culturally con- 
structed around common status and purpose rather than as dense webs of inter- 
action. McAdam & Rucht (1993) point to the importance of cultural categories 
such as "activist" in promoting the spread of tactics where relational ties are 
thin. Chaves (1996) finds that the ordination of women was contagious within 
groups of denominations defined by shared theological orientations. And in a 


ÓT ike all other communities, organizations and social movements display prestige orderings 
(see Schrum & Wuthnow 1988, Fombrun & Shanley 1990). 

7Tolnay et al (1996) find a surprising negative diffusion effect of geographic proximity on 
lynchings (and also exhaustion rather than contagion within counties) They argue that lynchings 
are a social control mechanism whose memory lingers in the local population 
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direct comparison of a variety of diffusion channels, Soule (1997) shows that 
shantytown protests diffused between similar kinds of campuses (for example, 
between research universities) rather than within regions. 

Culturally defined similarity may also inspire organizational arrangements 
that press for homogeneity. Strang & Chang (1993) show that the International 
Labor Organization has spurred the adoption and expansion of social security 
programs, particularly by the welfare laggards of the industrialized world 
(though the United States proved immune). Soule & Zylan (1997) find that 
AF(D)C reforms diffused within relevant administrative groupings rather than 
traditionally defined regions. 


CULTURAL BASES OF DIFFUSION 


Both theory and empirical work generally focus on the sorts of structural bases 
for diffusion catalogued above. But this is only part of the story. Structural op- 
portunities for meaningful contact cannot tell us what sorts of practices are 
likely to diffuse, and such opportunities may lead to conflict or boundary for- 
mation as well as to diffusion. 

An analysis of the cultural (in some usage, institutional) bases of diffusion 
speaks more directly to what spreads, replacing a theory of connections with a 
theory of connecting. We emphasize three lines of analysis: discussion of the 
interpretive work that catalyzes flow, inspection of the diffusion industries 
whose stock in trade is discourse, and examination of how empirical diffusion 
patterns are related to the cultural status of the diffusing item. 


Interpretive Work as Mediating Diffusion 


Cultural approaches emphasize that a self-consciously interpretive process un- 
derlies most adoption (though there is a place for unthinking mimicry and hys- 
terical contagion; see Kerckhoff & Back 1968). Strang & Meyer (1993) discuss 
how practices are theorized in terms of general models and causal relationships. 
Snow & Benford (1992) apply Goffman’s notion of a frame: an “interpretive 
schema that simplifies and condenses the ‘world out there’ by punctuating and 
encoding objects, situations, events, experiences, and sequences of action.” (p. 
137). Lillrank (1995) portrays the interpretive process as one of translating con- 
crete practices into abstractions for export and then unpacking the abstraction 
into a (suitably modified) concrete practice upon arrival. Jointly, the argument 
is that practices diffuse as they are rendered salient, familiar, and compelling.® 
Strang’s (1997) inquiry into the American reception of quality circles ex- 
plores theorization via a content analysis of public discourse. Articles in the 


Differences between these ideas have to do with the types of cultural materials viewed as most 
powerful (professional/scientific accounts vs cultural metaphors) and the patterns of diffusion 
anticipated (substantial homogeneity vs tailored differences) 
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business literature are coded for the claims they make about quality circles. 
The Japanese practice is found to have been theorized under two different 
frames, a dominant human relations interpretation and an undertheorized 
problem-solving one. These public discourses help us understand how and 
why American companies experimented with quality circles. 

Snow (1993) examines framing in the importation of Nichiro Shoshu/Saka- 
gakkai (NSS), a Japanese-based Buddhist movement, into the United States. 
He emphasizes that the incorporation of American cultural symbols by the 
NSS has facilitated the movement's expansion and viability. The NSS displays 
national symbols such as the American flag in its ceremonies, directs members 
to be winners (a decidedly non-Buddhist ideal), and peppers its communiqués 
with American archetypes such as the pioneering spirit and town meetings.? 

Perhaps the richest analysis of interpretation is Hirsch's (1986) discussion 
ofthe language associated with hostile takeovers. This imagery shifts dramati- 
cally over time, as initially stark portrayals of hostile takeovers as crimes com- 
mitted by outsiders are replaced by a more complex, richer imagery of 
shootouts, Big Hat Boys, rescues, and Snow Whites. Hirsch treats this lan- 
guage as a cultural phenomenon that evolves along with takeover behavior and 
its social location within the business community, initially framing resistance 
and later framing acceptance. 

In addition to generating interesting stories, attention to the interpretive 
work underlying diffusion has two main implications. It points out that prac- 
tices do not flow: Theorized models and careful framings do. And it argues that 
interpretive work selects and transforms diffusing practices: Not all practices 
can be theorized or framed, and none come out of the process unmodified. 


Fashion-Setting Communities 


Interpretive work promoting diffusion is accomplished by both sources and 
adopters; sometimes the source, sometimes the adopter, and sometimes both 
play an active role (Snow & Benford 1995). But cultural approaches to diffu- 
sion direct particular attention to the external communities whose members 
make their living promulgating innovation and commenting on change. These 
others (Meyer 1995) have access and influence largely to the extent that their 
interpretive frames are compelling to decision makers, and so here we see 
much attention to the cultural conditions for diffusion. 

Today, the management fashion industry is very big business. While the 
theorization and hyping of organizational action has always been fundamental 
to managing (Eccles & Nohria 1992), a strong trend toward the externalization 


?similarly, the shantytown tactic may have diffused rapidly in the college divestment 
movement because it provided a clear and compelling frame for the conflict emphasizing the living 
conditions of South African blacks. There is little evidence that use of the tactic prompted 
university drvestment (Soule 1998) 
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of organizational analysis is apparent. The consultant, guru, and management 
scholar populations are on the rise, as are the output of the business press and 
the sales of business books (see Micklethwait & Wooldridge 1996). 

Researchers have begun to probe the content of the business fashion-setting 
business. Barley & Kunda (1992) argue that managerial discourse oscillates 
between rational and normative models of organizing. Periods dominated by a 
master narrative of rationalism facilitate the construction, dissemination, and 
contagiousness of practices such as systems analysis, time and motion studies, 
and reengineering. Periods marked by a narrative of normative integration en- 
hance the diffusion of human relations techniques and culture engineering. 

These rhetorical frames appear to be the product of both local conditions and 
the cultural materials available in even wider societal frames. Barley & Kunda 
(1992) suggest that the rational-normative opposition reflects a deep antimony 
in Western culture that is regulated by temporal segregation. Shenhav (1995) 
links the rise of the Taylorist model to the professional mobility project of en- 
gineers, labor unrest, and the society-wide frame of Progressivism. And Abra- 
hamson (1997) finds that turnover and labor union activity help explain the 
postemergence prevalence of normative rhetorics such as the human relations 
movement. 

Collective discourses on narrower organizational practices also exhibit im- 
portant regularities (Abrahamson 1996, Abrahamson & Fairchild 1997). Inno- 
vations have observable latency periods before bursting onto the scene and re- 
place each other in quick succession. These dynamics seem to arise both from 
processes internal to the fashion industry and from exogenous drivers. Fashion 
setters must move on lest others catch up, and norms of progress mandate that 
old wine be placed in new bottles. Nor can fashions predicated on Japanese in- 
dustrial superiority easily withstand a crash on the Nisei. 

Discursive frames also arise in the social movement arena. Gamson & 
Mondigliani (1989) trace shifts 1n the discussions of nuclear power that en- 
abled or disabled various forms of protest. The various media also apply char- 
acteristic modes of inquiry and representation. For example, newspapers edi- 
torialize while television is guided by a particular conception of balanced re- 
porting where two sides of every issue are located and represented. Tarrow 
(1989) argues that the media's attention to the sensational produces spirals of 
more controversial action—an insight that might also be applied to organiza- 
tional innovation. 


The Cultural Status of the Diffusing Practice 


Practices that accord with cultural understandings of appropriate and effective 
action tend to diffuse more quickly than those that do not. Strang (1990) shows 
that decolonization spread rapidly because it resonated with increasingly sali- 
ent models of national community, popular sovereignty, and expanded partici- 
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pation. Hirsch (1986) notes that the frequency of hostile takeovers increased as 
the practice was symbolically legitimated, and Tolbert & Zucker (1983) find 
that the pace of civil service reform accelerated after professional groups came 
to consensus on its virtues. 

Menzel (1960) organizes the results of much early diffusion research by ob- 
serving that centrally placed actors are early adopters of culturally legitimate 
innovations, whereas illegitimate innovations are adopted by marginal men" 
unconstrained by community norms. Contemporary research suggests similar 
patterns. For example, Kraatz & Zajac (1996) find that poor, failing liberal arts 
colleges adopt professional programs inconsistent with their larger identity. 
Leblebici et al (1991) note that fringe players were the carriers of innovations 
that challenged and repeatedly transformed the institutional structure of radio. 
Stearns & Allan (1996) argue that peripheral firms set off merger waves by re- 
sponding quickly to changing political and economic conditions. 

Strang & Meyer (1993) suggest that the more successfully theorized a dif- 
fusing practice is, the less its diffusion will be relationally structured. The no- 
tion is that an easily communicated, strongly legitimated innovation requires 
less local promotion and mutual sense-making than a practice that is hard to 
understand and motivate.! Davis & Greve (1997) make this point in a study of 
the diffusion of poison pill and golden parachute responses to the threat of hos- 
tile mergers. They find that the pill diffused rapidly via board interlocks, 
whereas parachutes spread slowly within local business communities. Davis & 
Greve argue that the public legitimacy of the poison pill permitted the rela- 
tively thin, information-carrying medium of corporate board contacts to chan- 
nel adoption, while the scandalous parachute required mutual reassurance 
within business communities. 

However, bandwagons are increasingly unlikely to form as illegitimacy 
rises in the eyes of adopters. For example, Kraatz & Zajac (1996) find no evi- 
dence of contagion in professional program adoption by liberal arts 
schools—colleges introducing these programs look more like defectors bow- 
ing to financial need than participants in a social movement for educational 
relevance. And Baker & Faulkner (1997) point to the extreme case of a real- 
estate swindle, whose perpetrators must minimize publicity and interaction. 


A WIDER COMPARATIVE LENS 


The most common design 1n diffusion research treats variability in the timing 
of adoption of a single practice across a single community (a relationally and 
culturally connected population). Almost all of the previously mentioned stud- 


IOn a convergent vein, Tarrow (1994) argues that modular forms of protest like the boycott and 
the mass petition supported more widespread action and faster diffusion because they could be 
flexibly utilized against different opponents and ın service of different causes 
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ies are of this type. Much less work compares rates, patterns, and causal 
mechanisms across settings. We emphasize work that promotes a broader 
comparative analysis. 


Cycles of Protest and Innovation 


Diffusion processes may play a role in more complex webs of action and reac- 
tion. For social movements, the tendency of diffusion dynamics to spread and 
amplify protest is opposed by increasingly strong responses by the state. Pit- 
cher et al (1978) present an early formal model ofthe instigation and inhibition 
of collective violence as learning processes. Olzak (1992) models the dynamics 
of collective action as the combined result of contagion and exhaustion effects. 

Tarrow (1989, 1994) points to a larger set of dynamics producing protest 
cycles like the American civil rights~to—antiwar cycle of the 1960s. Cycles are 
periods of heightened conflict when new ideas are developed rapidly and dif- 
fuse across movement organizations that support, compete, and learn from one 
another.!! These cycles exhibit at least three kinds of diffusion: (a) Collective 
action spreads across space and sectors (class conflict might move from heavy 
to light industry). (b) New frames of meaning diffuse across as well as within 
movements (for example, the rubric of “rights” spread from the civil rights to 
the women's movement). And (c) novel tactics, such as the sit-in, are forged 
and diffuse within protest cycles. 

McAdam (1995) elaborates this model in a discussion of relationships be- 
tween initiator movements (such as Solidarity in Poland) and the spin-off 
movements that follow. Meyer & Whittier (1994) describe the strong influ- 
ence of the women's movement on the ideas, tactics, and organizational struc- 
ture of the 1980s peace movement. 

Business communities display parallel dynamics in cycles of technological 
and managerial innovation. For example, the 1980s and 1990s have been a hot- 
bed of efforts to transform organizations. Progressive firms such as Motorola, 
managerial consultants such as CSC Index, and gurus like Tom Peters are the 
carriers of a variety of strategies for enhancing quality, speeding innovation, 
downsizing, and empowering workers. These movements spread from firm to 
firm, often following a core-periphery pattern (from big manufacturing and 
high-tech to services to education and government). They compete but also 
learn from and build on each other, as opposing strategies such as TQM and 
reengineering become hard to distinguish in practice. 


'l Soule & Tarrow (1991) explore perhaps the first modern cycle of protest in the revolutions of 
1848. Both spatial patterns in the temporal incidence of collective action and qualitative evidence 
make it clear that protest was diffusing across countries (mobs ın Germany carried French flags and 
sang French songs). The rate of diffusion in this era of slower mass communications is startling. 
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Same Practice, Different Communities 


A tale from a Korean village (Rogers & Kinkaid 1981) suggests the impor- 
tance of cultural context. Family planning in the village of Oryu Li faced 
strong resistance from husbands, who beat their wives if they tried it. It spread 
only after a mother's club not only promoted contraception but restructured 
the distribution of power in the village. Led by the indefatigable Mrs. Choi, the 
club bought the local wineshop and fired its "chopstick girls," raised a pig, 
manufactured uniforms and sold them at a profit, and accumulated sufficient 
funds to buy much of the land surrounding the village. What would have oc- 
curred through contagious diffusion in a Midwestern town was in Oryu Li a 
saga of heroism, collective action, and changing gender roles. 

In less dramatic fashion, explicit comparisons of diffusion processes across 
national societies demonstrate the operation of structural factors that would 
otherwise be missed. Cole (1985, 1989) argues that the diffusion patterns of 
small group activities in three countries were molded by national infrastruc- 
tures for diffusion. In Japan and Sweden, central organizations bankrolled by 
industry promoted and oversaw the diffusion of best practice. The American 
business sector lacked such institutions, and instead business consultants oper- 
ated in a free-for-all market for innovation.!? Cole argues that the absence ofa 
larger infrastructure led to tepid and faddish diffusion, where business consult- 
ants gained little access to top decision-makers and watered down their wares 
for mass promotion. 

Guillen (1994) examines the reception of several major schools of manage- 
ment across four countries. He focuses on the impact of national cultural dis- 
course, structures of state and occupational power, and business interests. For 
example, elite mentalities of modernism and a strong engineering profession 
hastened German use of scientific management techniques, while Spain's tra- 
ditional humanism, labor unrest, and weak engineering profession led Taylo- 
rism to wilt on the vine. 


Different Practices, Same Community 


Comparisons of different practices diffusing in a single population or the same 
practice diffusing in different communities often highlight how cultural under- 
standings shape adoption patterns. The work of Davis & Greve (1997) de- 
scribed above is very much in this line. Another example is Mizruchi & Fein's 
(1997) analysis of how authors have employed DiMaggio & Powell's (1983) 
concepts of coercive, mimetic, and normative mechanisms producing isomor- 
phism. They find the greatest reference to mimetic processes, and argue that 


l2One does see a state-sponsored infrastructure in the American health sector, with its 
experiments, subsidized models, and regional innovation-diffusion centers (Fennell & Warnecke 
1988). 
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this follows from the resonance of the idea of rational copying given the view 
that organizations are autonomous and are rational actors. 

Rowan (1982) provides a more structural analysis of legitimation, arguing 
that innovations diffuse rapidly when core actors are in agreement and fizzle 
when they are not. For example, curriculum reform was adopted rapidly by 
school districts when the state legislature, the state educational agency, and the 
teacher’s association supported the same model. School districts disregarded 
curricular innovations when this consensus fell apart (for example, when the 
legislature regarded new texts as too radical). 

Shifts in Causal Effects During Diffusion 

Finally, much research looks for shifts in causal processes as diffusion un- 
folds. The most influential such analysis is Tolbert & Zucker’s (1983) discus- 
sion of how local rationality is replaced by conformity to institutional models. 
They argue that civil service reforms diffused slowly in the nineteenth century 
in ways consistent with relevant city characteristics. After 1915, when civil 
service practices had become widely legitimated in professional circles, re- 
form diffused rapidly and indiscriminately. 

A related logic of crescive institutionalization appears in organizational 
studies that examine the changing effect of prior adoptions (rather than con- 
duct a separate discourse analysis). For example, Burns & Wholey (1993) find 
temporal decline in the effects of internal predictors and a growing effect of re- 
gional adoption 1n the diffusion of matrix management among hospitals. Bu- 
dros (1997) shows that the internal precipitants of corporate downsizing 
weaken over time while the overall bandwagon effect grows.1? 

Much work on national educational and welfare policy finds similar dy- 
namics. Welfare policy adoption early in the twentieth century was tied to eco- 
nomic transformations and development, whereas after World War II policies 
were adopted rapidly everywhere (Collier & Messick 1975). Educational sys- 
tems were tied closely to national characteristics in the nineteenth century but 
spread in broadcast fashion in the twentieth century (Meyer et al 1992). 

Westphal et al (1997) extend this well-documented institutionalization 
model in analysis of TQM practices across hospitals. Breaking with standard 
practice, they examine the relationship between the timing of adoption and 
what gets adopted, contrasting conventional implementation of TOM models 
(measured as closeness to average use and to theoretical models) with customi- 
zation of TOM to local conditions. Early adopters are shown to customize 
while late adopters adopt conventional forms, and network ties to adopters en- 


I3 Coefficient values for contagion are rather stable across the three historical periods of 
downsizing that Budros studies. But since the covariste (prior downsizing efforts) 1s nsing 
continuously, the total effect of prior adoptions increases over time. 
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courage customization early and conventionality late. They further show that 
conformity to TQM standards is positively related to hospital legitimacy but 
negatively related to efficiency. 


FORMAL MODELS AND ESTIMATION 


Interest in diffusion has stimulated much attention to models and methods that 
capture the interdependence in outcomes central to contagion. This work 
builds upon the larger movement toward the dynamic analysis of longitudinal 
data. We briefly note the range of approaches and research strategies charac- 
teristic of quantitative analysis of diffusion. 


Point-to-Point Processes 


Early modeling work in diffusion arose out of attempts to fit curves to cumula- 
tive adoption patterns. The key theoretical discovery was that contagion im- 
plied the commonly observed S-shaped cumulative adoption curve. A stan- 
dard mixed model combining both external and internal sources of diffusion 
(see Bartholomew 1982; Mahajan & Peterson 1985 for a review) gives 
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Models of contagion have been pursued in two main directions. The first 1s 
to draw inferences about underlying mechanisms from the shape of the adop- 
tion curve. The classic example is Coleman et al’s (1966) demonstration of dif- 
fering temporal patterns of adoption for socially integrated and isolated doc- 
tors. Hernes (1972) and Diekmann (1989) find that marital rates resemble a 
diffusion process marked by increasing ardor but declining suitability. Yama- 
guchi (1994) shows that Hernes-type models provide a good fit to simple diffu- 
sion processes across simulated networks. 

The more common strategy, however, is to model empirical diffusion pro- 
cesses at the individual level, writing event history formulations of Equation 1 
that incorporate hypothesized interdependencies between adopters (Strang 
1991, Morris 1993). For example, Davis (1992) analyzes the transmission of 
poison pill strategies by counting board interlocks with prior adopters; Zhou 
(1993) examines the diffusion of occupational licensing by counting the 
number of states with laws in place. 

Strang & Tuma (1993) formalize and extend this strategy, proposing a het- 
erogeneous diffusion framework that models the hazard as 


r(t) -expl o'x, + * (Bv, +Y W, +Ò Zp | 
se S(t) 
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for the multiplicative case and a related form for additive effects of contagion. 
This framework permits direct examination of intrinsic propensities to adopt, 
generalized susceptibility to influence, the infectiousness of prior adopters, 
and social proximity to be estimated via SAS macros (Strang 1995) or RATE 
(Tuma 1994). Simulation work (Greve et a] 1995) indicates that heterogeneous 
diffusion models can be estimated robustly with complete data on populations 
and have some application when data is incomplete. 

Spatial regression models (Doreian 1981, Marsden & Friedkin 1993) form 
a parallel strategy for estimating the effects of interdependence where out- 
comes are continuous—for example, if we studied the extensiveness of down- 
sizing or the size of demonstrations. Models take the form 


y=pWy+XB+e 3. 


where W represents the hypothesized structure of interdependence. While full 
information methods are unwieldy, Anselin (1988) and Land & Deane (1992) 
present estimation techniques that shortcut these problems and make spatial 
regression modeling widely accessible. 

Few methods are available for recovering network structures of influence 
from data, as opposed to the hypothesis tests that heterogeneous diffusion and 
spatial regression models permit. Mantel (1967) develops a general permuta- 
tion test for spatiotemporal clustering. This approach can be used to investi- 
gate network effects with a very general autocorrelation structure (Krackhardt 
1988), though temporal ordering is sacrificed. Strang (1996) suggests the 
study of multiple adoption processes to identify network influence structures. 


Threshold Processes 


Models of threshold processes break with the notion of direct contagion to 
view potential adopters as responsive to the distribution of present adopters in 
the population (Granovetter 1978, Schelling 1978). For example, it seems 
plausible that white flight from cities is based on response to racial proportions 
rather than to direct encounters. Granovetter (1978) emphasizes the nonlinear 
dynamics produced by variation in individual thresholds, and Valente (1995) 
proposes local thresholds for reference groups based on direct network ties. 

But thresholds have been difficult to establish empirically, with more use of 
revealed thresholds to describe adoption patterns (see Granovetter & Soong 
1988, pp. 99-102; Valente 1995) than application of threshold models to pre- 
dict behavior. Threshold processes are hard to identify if we need to locate 
both the reference group and the threshold. In the only explicit effort to locate 
thresholds of which we are aware, analysis of 85 policies diffusing across the 
United States provided no evidence of regional or national thresholds in state 
policy adoption (Strang 1996). 
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But other evidence does suggest that adopters often respond to combina- 
tions of signals. For example, Hagerstrand (1967) found that the spatial pattern 
of rural diffusion resembled that generated by simulations where two contacts 
with prior adopters led to adoption (simulations based on single contacts pro- 
duced greater spatial scatter than was observed in empirical maps). And Asch 
(1951) demonstrated that nearly total opposition was required to induce most 
subjects to disbelieve their own eyes. 


PIOUS HOPES FOR FUTURE RESEARCH 


Design 

While single-population, single-practice research designs will no doubt con- 
tinue to dominate the diffusion literature, theoretical development would 
benefit from a larger comparative lens. Considerable insight has been devel- 
oped on a case-by-case basis into the mechanisms behind the diffusion of a va- 
riety of important and interesting social practices. But insights are unlikely to 
be integrated, or analysts spurred to theorize more aggressively, without the 
challenges posed by comparative research. 

Direct contrasts of diffusing practices can provide more nuanced views of 
the mechanisms involved, as the work of Davis & Greve (1997) illustrates. In- 
dependent studies of poison pills and golden parachutes would likely have as- 
serted incompatible claims about the types of relational structures that underlie 
diffusion. Their joint analysis led to a deeper argument about how cultural 
meanings affect the strength of alternative diffusion mechanisms. 

More attention to how innovations compete and support each other is also 
needed. In the social movement arena, many students of collective action are 
beginning to question the movement-centric focus that case studies reinforce. 
Attention to how tactics, strategies, symbols, and frames diffuse across move- 
ments produces a richer picture well worth the research investment. And stud- 
ies of organizational diffusion would do well to place mutually evolving inno- 
vations in relation to each other rather than analyze them seriatim. 

Finally, we call for examination of practices that fail to diffuse. There is a 
strong selection bias in diffusion research, where investigators choose ulti- 
mately popular practices as appropriate candidates for study. Investigation of 
practices that few adopt would provide a more balanced picture. 

Study of practices that fail to diffuse would also shed light on those that do. 
For example, we noted above that the rapid diffusion of the shantytown tactic 
in the divestment movement may have flowed from its iconographic immedi- 
acy and symbolic power. Comparison to concrete tactics that were attempted 
but didn’t diffuse (campus sleep-ins, for example) could examine this proposi- 
tion, along with arguments about other attributes of tactics relevant to collec- 
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tive action (how they are repressed, how they build activist solidarity, how 
they appear on television, and whether they lead to desired results). 


Substance 


Relational analysis has been the backbone of diffusion research in sociology. 
But ideas based on interpersonal relations translate unclearly into situations 
where collective actors such as organizations are the adopters. The tendency to 
refer to the effect of any direct tie as cohesion is symptomatic (particularly 
since the ties under discussion often seem so weak). More important, the 
elaborate analyses of diffusion and diffusion-like dynamics mounted at the 1n- 
dividual level (work such as that of Burt, Carley, Doreian, Friedkin, Macy, and 
Marsden) do far more with the network metaphor than analyses of collective 
actors seem able to pull off. 

The problem is that collective actor parallels to face-to-face interaction are 
not as vivid or meaningful as the real thing. Valuable insights into diffusion tra- 
jectories have been garnered by analysis of interlocking directorates, geographic 
proximity, and culturally analyzed similarities as diffusion channels. But there is 
a need for close attention to what sort of information and influence flows 
through these channels. And ıt would be useful to develop models of interorgani- 
zational structure less colored by an analogy to direct interpersonal interaction. 

Finally, the fashion setters who construct and disseminate new practices de- 
serve renewed attention. Diffusion dynamics seem increasingly volatile, and 
diffusing practices increasingly constructed, as interpretive work is external- 
ized in public discourse. Study of the media, consultants, and professional 
communities permits attention to cultural work and forms of agency that 
adopter-centric research overlooks. The impact of vibrant diffusion industries 
on the political and the business scene has hardly begun to be tapped. 
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ABSTRACT 


A review of the scientific literature on the relationship between alcohol and 
violence and that between drugs and violence 1s presented. A review and 
analysis of three major theoretical approaches to understanding these rela- 
tionships are also presented. A number of conclusions are reached on the ba- 
sis of these efforts. First, despite a number of published statements to the 
contrary, we find no significant evidence suggesting that drug use 1s associ- 
ated with violence. Second, there is substantial evidence to suggest that alco- 
hol use is significantly associated with violence of all kinds. Third, recent 
theoretical efforts reviewed here have, despite shortcomings, led to signifi- 
cant new understanding of how and why alcohol and drugs are related to vio- 
lence. Fourth, these theoretical models and a growing number of empirical 
studies demonstrate the importance of social context for understanding vio- 
lence and the ways in which alcohol and drugs are related to violence. Fifth, 
the shortcomings of these theoretical models and the lack of definitive em- 
pirical tests of these perspectives point to the major directions where future 
research on the relationship between alcohol and violence, and between 
drugs and violence, 15 needed. 


INTRODUCTION 


That the United States leads the industrialized nations in rates of interpersonal 
violence is a well-documented fact (National Research Council 1993). Exam- 
ples of this can be seen in the extraordinarily high rates of violent crimes such 
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as homicide, robbery, and rape in the United States (National Research Coun- 
cil 1993, Parker & Rebhun 1995); an additional and disturbing fact that has 
come to light in recent years is the increasing rate of youth violence, particu- 
larly lethal violence (Blumstein 1995; Alaniz et al 1998). 

During the last decade, interest has grown in the relationship between alco- 
hol, drugs, and violence. In addition to the mostly misguided attention in mass 
media and in political circles to the relationship between illegal drugs and vio- 
lence, a number of empirical studies have attempted to disentangle the associa- 
tions between alcohol, drugs, and violence. Several studies have attempted to 
organize this knowledge into a comprehensive theoretical framework. This 
chapter synthesizes this body of work to assess the state of the art in thinking 
about the relationships between psychoactive substances and violent behavior. 

Defining and understanding the complex relationships among alcohol, 
drugs, and violence require that we examine issues of pharmacology, settings, 
and larger social contexts to understand the mechanisms that associate sub- 
stance use and violence in individuals. In addition to this, we must also consid- 
er not only the ways in which individuals are nested within larger social con- 
texts, but also the ways in which these contexts themselves may create condi- 
tions in which violent behavior takes place, for example, the ways in which 
availability of substances, while itself conditioned to some degree by larger so- 
cial forces, contributes to the spatial distribution of crime and violence. 

We do not attempt to review the growing literature on the biological aspects 
of violence. Despite increased interest in this area of research, no credible scien- 
tific evidence currently exists that demonstrates any significant link between 
biological characteristics and violence (National Research Council 1993). Fu- 
ture research may reveal complex interactions among biological, pharmacol- 
ogical, psychological, and contextual aspects of alcohol- and drug-related vio- 
lence, but no conclusive evidence exists to support this idea at present. 

In addition to trying to understand the ways in which alcohol and drug use 
may contribute to violent behavior, it is also important to consider the ways 
that alcohol and other drugs relate to human behavior in general. Some ad- 
vances have been made in the study of psychological expectancies concerning 
alcohol's effect on behavior (Brown 1993, Grube et al 1994), the relationship 
between alcohol and cognitive functioning (Pihl et al 1993), the impact of al- 
cohol on aggressive behavior (Leonard & Taylor 1983), and the dynamic de- 
velopmental effects of early exposure to alcohol and violence among young 
people (White et al 1993) and among women who have been victimized as 
children and as adults (Miller & Downs 1993, Widom & Ames 1994, Roesler 
& Dafler 1993). 

Similar work has attempted to understand the links between illicit drugs and 
behavior, although due to the attention focused on the illegality of these sub- 
stances, this body of work tends to be most concerned with illegal behaviors 
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that might be associated with drugs. Examples from this literature include ex- 
aminations ofthe links between drug use and delinquent behavior among juve- 
niles (Watts & Wright 1990, Fagan 1993, Fagan et al 1990); relationships be- 
tween substance use and domestic violence (Bennett 1995, Bennett et al 1994, 
Roberts 1987, Blount et al 1994); the ways in which the use and distribution of 
illicit drugs are related to all types of crime, particularly nonviolent property 
offenses (Ball et al 1982, Ball 1991, Baumer 1994, Greenberg 1976, Johnson 
et al 1994, Klein & Maxson 1985, McCoy et al 1995, Meiczkowski 1994, 
Feucht & Kyle 1996); and the impact of drug use on the ability to maintain in- 
terpersonal relationships (Joe 1996, Fishbein 1996, Lerner & Burns 1978). 

A fairly common problem specific to theoretical and empirical investiga- 
tions of the relationship between drugs and violence is the tendency—largely 
ideological—to lump all illicit drugs together, as if all drugs might be expected 
to have the same relationship to violent behavior. Different drugs certainly do 
have different pharmacological effects, which may or may not influence the 
user's tendency toward violence; this should be treated as a prominent empiri- 
cal question, rather than as an afterthought usually addressed only when results 
are disaggregated by drug type. Another problem specific to the analysis of the 
impacts of illicit drugs on behavior that hinders our understanding of the rela- 
tionship between drugs and violence in real-world (as opposed to laboratory) 
settings was cogently pointed out by one researcher—that the degree of both 
impurity and deception in the illicit drug market “makes any direct inferences 
between drug-taking and behavior seem almost ludicrous” (Greenberg 1976, 
p. 119; see also Johnson 1978). Evidence of the greater likelihood of polydrug 
use among more violent research subjects also confuses any causal inferences 
that can be made with respect to particular drugs (e.g. Spunt et al 1995, Inciardi 
& Pottieger 1994). 


DRUGS, ALCOHOL, AND VIOLENCE AT THE 
INDIVIDUAL LEVEL 


A rather fragmented research literature attempts to identify links between alco- 
hol, drugs, and violence at the individual or pharmacological level. This work 
is discussed briefly below, mainly as a prelude to theoretical models developed 
in light of these empirical findings. 

Evidence of an individual level association between alcohol and violence is 
widespread. For example, Collins (1981) reviewed a number of studies in 
which alcohol and violence were associated among individuals. Experimental 
studies have also shown a consistent relationship at the individual level be- 
tween alcohol use and aggressive behavior, especially in the presence of socia] 
cues that would normally elicit an aggressive response; the consumption of al- 
cohol increases the aggressiveness of this response (Taylor 1983, Gantner & 


294 PARKER & AUERHAHN 


Taylor 1992, Pihl et al 1993). Roizen (1993, pp. 4—5) reports that in nearly 40 
studies of violent offenders, and an equal number of studies of victims of vio- 
lence, alcohol involvement was found in about 5096 of the events and people 
examined. Although most individual-level studies assume that alcohol has a 
potentially causal role, an argument supported by the experimental studies 
cited here, some have argued variously that the relationship is spurious (Col- 
lins 1989), that both are caused by third factors (Jessor & Jessor 1977), or that 
aggression and violence precede alcohol and drug abuse (White et al 1987). 
In general, little evidence suggests that illicit drugs are uniquely associated 
with the occurrence of violent crime. While respondents of the 1991 National 
Criminal Victimization Survey perceived more than one fourth of violent 
criminal assailants to be under the influence of alcohol, less than 1096 of these 
assailants were reported by victims to be under the influence of illicit drugs. Of 
these, more than half were reported to be under the influence of both alcohol and 
drugs (Bureau of Justice Statistics 1992a). These percentages are supported by 
urinalysis data for persons arrested for violent offenses, which yield the find- 
ing that 1n 1990, only 5.696 of violent offenders were under the influence of il- 
licit drugs at the time of their offense (US Bureau of Justice Statistics 1992b). 
Studies of the drug and alcohol involvement of homicide offenders and vic- 
tims also support the notion that alcohol 1s, overwhelmingly, the substance 
most frequently implicated in this particular form of violence (Abel 1987, 
Spunt et al 1994, 1995, Wieczorek et al 1990, Yarvis 1994, Fendrich et al 
1995, Goldstein et al 1992). Interview studies with homicide offenders as well 
as toxicology studies of homicide victims consistently report that approxi- 
mately half of all homicide offenders are intoxicated on drugs or alcohol at the 
time of the crime; similar percentages of homicide victims test positive for 
substance use as well (Abel 1987, Langevin et al 1982, Ray & Simons 1987, 
Fendrich et al 1995, Spunt et al 1994, 1995, Wieczorek et al 1990, Kratcoski 
1990, Welte & Abel 1989, Garriott 1993, Tardiff et al 1995). Some evidence 
suggests that alcohol is the substance most frequently implicated in other vio- 
lent events as well (Buss et al 1995, US Bureau of Justice Statistics 1992a).! 


lpyfficulties inherent in trying to assess the involvement of alcohol relative to other drugs in 
violent events are largely the result of the way ın which the research agenda surrounding the 
relationship between drugs, alcohol, and violence has been constructed. The majonty of data 
collection efforts seem to be focused either on one particular substance (e g. cocaine) and its 
relationship to or involvement in violent episodes or on comparisons between alcohol and illicit 
drugs m general, thereby hindering comparisons not only between alcohol and other drugs, but 
between different illicit drugs as well. A recent example 18 the National Institute of Justice report 
entitled Drugs, Alcohol, and Domestic Violence in Memphis (1997), which details research 
conducted to determine the role of substance use in incidents of domestic violence At no point in 
the report are alcohol and drug use separated into distinct phenomena, making it impossible to 
determine what substances may be associated with domestic violence 
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A shortcoming common to much of the work that has attempted to disentan- 
gle the individual-level relationships between drugs, alcohol, and violence is 
that many researchers fail to make a theoretical and/or empirical distinction 
between different types of drugs. For this reason, a short review of the litera- 
ture concerning the links between violence and specific types of illicit drugs is 
presented below in the hope that some general conclusions can be drawn about 
the nature and magnitude ofthe relationship between illicit drugs and violence. 


Heroin 


Evidence to support a link between heroin and violence is virtually nonexist- 
ent. While there is some evidence that heroin users participate in economically 
motivated property crimes (see Kaplan 1983, pp. 51—58 for a thoughtful and 
critical discussion of this issue), the work of Ball and his colleagues (Ball et al 
1982, Ball 1991) fails to uncover persuasive evidence for a link between her- 
oin use and violent crime. Although no specific measures for violent crime are 
reported in the analysis of self-reported criminality (validated by official rec- 
ords) from a sample of 243 heroin addicts in Baltimore, only 396 ofthe sample 
reported commutting, on a daily basis, any crime other than theft; the figures 
for the weekly and “infrequent” commission of crimes other than theft are 3% 
and 996, respectively (Ball et al 1982). A later, more comprehensive analysis 
undertaken to determine whether or not “common forces attributable to heroin 
addiction are of primary etiological importance with respect to crime" (Ball 
1991, p. 413) compares addict samples from three major Eastern cities. Echo- 
ing the results of the 1982 study, involvement in violent crime was negligible, 
accounting for between 1.5% and 5.6% of all addict criminality across cities 
(Ball 1991, p. 419). 


Amphetamines 


Considerable investigation has been made into a possible pharmacological 
link between amphetamines and violence. Some evidence indicates that in rare 
cases, either sustained periods of heavy use or extremely high acute doses can 
induce what has variously been called “toxic psychosis” or “amphetamine- 
induced psychosis,” a reaction that is virtually indistinguishable from schizo- 
phrenia (Ellinwood 1971, Fukushima 1994). Aside from these extremely rare 
cases, some evidence may speak to a link between violent behavior and am- 
phetamine use in ethnographic samples (Joe 1996) and in case-study research 
(Ellinwood 1971). One researcher notes, however, that this link may result 
from situational influences: “several...subjects seem to have lost intellectual 
awareness because they lived alone and had little chance to cross-check their 
delusional thinking. A long-term solitary lifestyle seems particularly signifi- 
cant in fostering this effect” (Ellinwood 1971, p. 1173). 
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The importance of context and situation for the association between am- 
phetamine use and violent behavior is supported by animal studies as well; 
Miczek & Tidey (1989) report that the social relationship between experimen- 
ta] animals significantly influences the level and type of violent behavior that 
they manifest when on amphetamines (Miczek & Tidey 1989, p. 75). Addi- 
tionally, the baseline rate of violent or aggressive behavior prior to ampheta- 
mine administration was an important predictor of violent behavior after drug 
administration. The authors conclude from this review of animal studies that: 


Among the most important determinants of amphetamine effects on aggres- 
sive and defensive responses are the stimulus situation, species, prior expert- 
ence with these types of behavior, and. ..dosage and chronicity of drug expo- 
sure. (Miczek & Tidey 1989, p. 71) 


Cocaine 


Some evidence suggests that cocaine use and violent behavior may be associ- 
ated (Miller et al 1991, Budd 1989, Inciardi & Pottieger 1994); one of the most 
widely reported pharmacological effects of cocaine in users is feelings of para- 
noia (Goode 1993, Miller et al 1991). At least one group of researchers suggest 
that cocaine-associated violence “may in part be a defensive reaction to irra- 
tional fear” (Miller et al 1991, p. 1084). 

The route of administration may influence the likelihood of violent behav- 
ior in users, with methods delivering the most intense and immediate effects 
being most closely associated with some forms of violent behavior. Users who 
smoked the drug in the form of “crack” were most likely to engage in violence 
proximate to cocaine use, followed by users taking the drug intravenously. Us- 
ers who “snorted” the drug were found to be least likely to engage in violence 
(Giannini et al 1993). However, these researchers also reported that forms of 
violence *requiring sustained activity" (defined by the authors to include such 
acts as rape and robbery) were not associated with route of administration of 
cocaine. Because of this, the authors conclude that “circumstance and situation 
may be as important as route of administration" (Giannini et al 1993, p. 69). 

The greater influence of social rather than pharmacological factors on the 
cocaine-violence relationship has also been reported elsewhere. Goldstein et al 
(1991) found that the relationship of violence to volume of cocaine use varied 
according to gender, with only male “big users" of cocaine contributing dis- 
proportionately to the distribution of violent events reported by the sample as a 


2Miller et al (1991) failed to find any such relationshup between route of administration and 
violence; however, the authors point out that this lack of finding may be explained by the use of a 
treatment sample of users who were likely using cocaine 1n such high dosage and frequency as to 
blur any distinction between acute toxicity effects specific to route of administration (Miller et al 
1991, p. 1084) 
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whole (Goldstein et al 1991, p. 354). Additional evidence for the importance of 
context can be found in ethnographic research, which reports that a great deal 
of violent behavior experienced by crack-using women arises as a result of 
their involvement in prostitution, which is related circumstantially, although 
not pharmacologically, to their drug use (Mieczkowski 1994, Johnson et al 
1994). 

An issue of research design has emerged in the extensive literature sur- 
rounding cocaine use and violence. Chitwood & Morningstar (1985) report 
systematic differences between samples of cocaine users 1n and out of treat- 
ment programs, with samples from those in treatment characterized by greater 
cocaine use in both frequency and volume. This difference has been reported 
elsewhere (e.g. Miller et al 1991); Inciardi & Pottieger (1994) also report that a 
comparison of cocaine users in treatment to users not in treatment reveals that 
treatment users were substantially more likely to be polydrug users and to en- 
gage in violence. These findings are important in that the type of sample used 
may, at least in the case of cocaine, greatly influence the findings about a drug- 
violence association. 


Phencyclidine 


Phencyclidine (PCP) is widely believed to be associated with violence; this 
conclusion is based almost exclusively on case study research, often of indi- 
viduals with psychiatric disturbances (e.g. Lerner & Burns 1978, McCarron et 
al 1981). Ketamine, a drug pharmacologically quite similar to PCP, has en- 
joyed increasing popularity in recent years (Dotson et al 1995), PCP and Keta- 
mine are classified as “dissociative anaesthetics” because they diminish 
awareness not only of pain but also of the environment in general. Delusions, 
paranoia, and (1n rare cases) psychosis are among the most commonly reported 
effects of these drugs by users and clinicians (Marwah & Pitts 1986, Lerner & 
Burns 1978, McCarron 1986, Dotson et al 1995). However, one researcher 
concludes that "emotionally stable people under the influence of PCP probably 
will not act in a way very different from their normal behavior" (Siegel 1978, 
p. 285). 

Official crime statistics fail to show conclusive evidence for a unique link 
between PCP use and violent crime; arrestees who were not under the influ- 
ence of illicit drugs (according to urinalysis) were more likely to be charged 
with assault than were persons testing positive for PCP (Wish 1986). Among 
PCP-positive arrestees, the conditional distribution of offenses is influenced 
toward a greater likelihood of robbery charges, but Wish (1986) notes that this 
may be an artifact of demographic coincidence; PCP users tend to be younger 
than the average user of illicit drugs and thus coincide with the age group that 
dominates robbery arrests (Wish 1986, Maguire et al 1993). 
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Summary 


This review of the evidence concerning the relationship between the use of 
various illicit drugs and violence makes it clear that support for such linkages 
is absent. At best, we can characterize the available results as inconclusive. 
The strongest evidence is for a link between cocaine use and violence; how- 
ever, the conclusions of researchers whose findings support this idea univer- 
sally highlight a social rather than a pharmacological basis for this link. At 
present, no compelling evidence exists to support an association between vio- 
lence and amphetamines, Phencyclidine/Ketamine, or heroin. While there is 
some evidence that some of these drugs may induce psychosis, this reaction is 
exceedingly rare; virtually all research on this phenomenon consists of case 
studies, making it impossible to even estimate the frequency of such reactions 
in the population. 

The most extensive research literature concerning drugs and violence is that 
of investigations of the relationship between cocaine use and violence. A 
search through Sociological Abstracts reveals that this literature has grown 
concurrently with concern about, if not use of, cocaine (see White House Of- 
fice of National Drug Control Policy 1997 for use statistics). Between 1970 
and 1980, only four articles with “cocaine” or "crack" in the title are indexed, 
while between 1980 and 1990 there are approximately 75; in the 1990s, this 
figure is at nearly 200 before the decade's end. However, even ın the face of 
this profusion of research interest, we are still unable to say with any certainty 
that cocaine use and violent behavior are related. In part this may be attribut- 
able to the limitations inherent in ideologically driven research (e.g. Inciardi & 
Pottieger 1994); it may also indicate that such a link really does not exist, and 
that any amount of looking will continue to fail to uncover it. At this point in 
the state of our knowledge, it is clear that we must look beyond the level of the 
individual user in order to adequately understand and characterize the relation- 
ship (if any) between illicit drugs and violence. 


THEORETICAL APPROACHES 


We have identified four recent attempts to specify and/or explain the linkages 
among drugs, alcohol, and violence that are worthy of discussion, either for the 
fact of their prominence in the research literature or for the promise of greater 
understanding that they afford. Three of these four approaches have associated 
with them at least some empirical tests of the theories; these are discussed 
along with the explication ofthe theories. Each 1s discussed in turn, with atten- 
tion then passing to the commonalities between these theories, to determine 
whether a useful synthesis can be made. 
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Fagan's Approach: Intoxication, Aggression, and the 
Functionality of Violence 


Jeffrey Fagan has produced several attempts to formulate a comprehensive 
theory of the relationship between the use of psychoactive substances, vio- 
lence, and aggression (Fagan 1990, 1993, Fagan et al 1990). In addition, he has 
also been part of a joint effort to further our understanding of youth violence in 
general (Fagan & Wilkinson 1998); this work is discussed here briefly vis-à- 
vis its complementarity with Fagan's formulations of the relationship of alco- 
hol, drugs, and violence. 

Above all, Fagan and his colleagues argue for the use of hierarchical or 
"nested contexts" models if we are to gain any understanding of the etiology of 
violence in general and of the relationships between substance use and vio- 
lence (Fagan 1993, 1990). In his most recent work Fagan has argued for a 
"situated transactions" framework as the most promising way to understand 
youth violence (Luckenbill 1977). 

In assessing the relationships between alcohol, drugs, and violence, Fagan 
(1990) has reviewed research and theoretical arguments from biological and 
physiological research, psychopharmacological studies, psychological and 
psychiatric approaches, and social and cultural perspectives in an attempt to 
present a comprehensive model of this relationship. He argues that the most 
important areas of consensus from these different perspectives are that intoxi- 
cation has a significant impact on cognitive abilities and functioning, and that 
the nature of this impact varies according to the substance used but is, in the 
last instance, moderated by the context in which behavior takes place. For ex- 
ample, social and cultural meanings of how people function under the influ- 
ence of alcohol, understandings about the impact of intoxication on judgment, 
the ability to perceive social cues, and the ability to focus on long- as well as 
short-term outcomes and desires are all extremely important factors in deter- 
mining the outcome of a social situation in which drugs or alcohol are present 
and whether that situation will result in violence. The nature of the setting in 
which interaction takes place and the absence or presence of formal and infor- 
mal means of socia] control are also important factors whereby intoxication in- 
fluences aggression. Fagan also posits that intoxicated individuals tend to have 
limited response sets in situations of social interaction (1990, pp. 299—300); 
Fagan & Wilkinson (1998) extend this view to a general analysis of the etiol- 
ogy of youth violence. 

To date, no empirical tests of this model exist. Fagan's approach leads to a 
very general theoretical model that would require substantial revision to per- 
mit empirical testing. For example, the outcome measure, aggression, is hardly 
the same thing as violence, although there is certainly some relationship be- 
tween these concepts. Further theoretical explanation is needed to establish the 
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transition from aggression to violence, as well as the linkages between the an- 
tecedents of aggression and aggression itself. 

Fagan & Wilkinson propose a general model of youth yiolence that is rele- 
vant to this discussion. They propose that youth violence is “a functional, pur- 
posive behavior that serves definable goals within specific social contexts" 
(Fagan & Wilkinson 1998, p. 2). Fagan & Wilkinson argue that one of the most 
important benefits that accrue to youth from the use of violence is the attain- 
ment of status, something to which youths have limited access. The social 
world in which adolescents operate places an increasingly high premium on 
status and reputation; broader contextual influences such as technology (in the 
form of weapons) are important in “raising the stakes” of potentially violent 
situations, which may change the meanings attributed to different behaviors 
(Fagan & Wilkinson 1998). Another factor that may influence the meanings 
attributed to the actions of others is the consumption of drugs or alcohol, due to 
the behavioral expectancies that may be associated with them. These poten- 
tially violence-producing combinations in meaning-assignment may be par- 
ticularly significant when considered in the context of the cognitive limitations 
of the developmental stage of adolescence (Leigh 1987). Dating violence may 
be a particularly relevant phenomenon to examine within this framework, 
given the highly charged adolescent expectancies surrounding alcohol con- 
sumption and sexuality (George et al 1988, Corcoran & Thomas 1991) as well 
as the heightened importance of sexuality to status attamment at this develop- 
mental stage (Fagan & Wilkinson 1998). 


Selective Disinhibition: Parker 's Approach 


Parker (1993) and Parker & Rebhun (1995) attempt to specifically link alcohol 
and violence in an overall conceptual model, utilizing rates of homicide as the 
indicator of violent behavior. Parker & Rebhun (1995) advance a sociological 
approach to the relationship between alcohol and violence that is much differ- 
ent from earlier, biologically based formulations of this relationship (see 
Room & Collins 1983 for a review of that literature and the widespread criti- 
cisms applied to this notion). In these earlier conceptualizations, alcohol was 
conceived as a biochemical agent that had a universal effect on social behav- 
ior, despite substantial evidence from cross-cultural studies that alcohol has a 
differential impact on behavior depending on the social and cultural contexts 
in which it is consumed (see Marshall 1979 for a number of examples of this 
point). 

Noting this limitation of previous formulations, Parker & Rebhun (1995) 
advance a social disinhibition approach, which tries to explain why norma- 
tively proscribed behavior is *disinhibited" in relatively few cases. Alcohol se- 
lectively disinhibits violence depending on contextual factors specific to the 
situation, the actors involved and their relationships to one another, and the im- 
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pact of bystanders. In US society, norms about the appropriateness of violence 
in solving interpersonal disputes argue both for and against such behavior 
(Parker 1993). The theory proposes that individuals are constrained from en- 
gaging in certain behaviors in a social situation by the norms that they have in- 
ternalized; however, people do violate norms and may have conflicting sets of 
norms to draw on in some situations. It is possible that norms that have the 
least institutional support are more likely to be disinhibited in a situation, all 
else being equal (Parker 1993, p. 118). 

To explain how choices are made between these conflicting normative 
structures, Parker & Rebhun (1995, p. 34—35) introduce the tandem concepts 
of active and passive constraint. In potentially violent situations, it takes active 
constraint—a proactive and conscious decision not to use violence to "solve" 
the dispute—to preclude violence. In some of these cases, alcohol may disin- 
hibit norms that usually prevent or constrain individuals from engaging in vio- 
lent behavior. Thus, the selective nature of alcohol-related homicide 1s de- 
pendent upon the interaction of an impaired rationality and the nature of the so- 
cial situation. The nature of the social situation, or the context in which behav- 
ior takes place, is of paramount importance in determining the outcome of a 
potentially violent situation. This is indicated by the fact that most alcohol- 
involved interpersonal disputes do not result in violence and homicide, but a 
few of these situations do (Parker & Rebhun 1995; see also Wilbanks 1984). 

Parker & Rebhun (1995) further refined and specified their theoretical 
model of the ways in which alcohol consumption and homicide rates might be 
related at the aggregate level by incorporating into the model control variables 
suggested by previous literature on the etiology of homicide, such as subcul- 
tural theories (e.g. Wolfgang & Ferracuti 1976), social bonds theory (e.g. 
Hirschi 1969, Krohn 1991), deterrence theory, routine activities (Cohen & Fel- 
son 1979), and, taking a cue from strain and social disorganization theories 
(e.g. Merton 1949, also Wilson 1987), controls for economic inequality and 
poverty rates. 

A test of this particular specification of the theory was reported by Parker 
(1995). Cross-sectional analysis of state-level data was undertaken for five dif- 
ferent types of homicide, differentiated by circumstances of crime and/or 
victim-offender relationship (e.g. robbery homicide, family homicide). Alco- 
hol consumption was a significant predictor of family intimate and primary 
nonintimate homicide, or those homicides involving the closest interpersonal 
relationships. These results suggest that norms prohibiting violence in resolv- 
ing interpersonal disputes in close or intimate relationships may be weaker 
than such norms prescribed in other interactions; alcohol consumption would 
appear to contribute to the "selective disinhibition" of an already weak norma- 
tive apparatus. Parker (1995, p. 27) also reported that the impact of poverty on 
robbery and other felony homicides was stronger in states with above average 
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rates of alcohol consumption; the deterrent effect of capital punishment on 
homicide rates was strongest in states that had below average rates of alcohol 
consumption, providing further support for the importance of the interplay be- 
tween alcohol consumption and contextual and social situational factors in the 
disinhibition of active constraint. 

Parker & Rebhun (1995) also report the results of two tests of this approach 
that utilize longitudinal research designs. The first, using city-level data, 
yielded evidence that increases in alcohol availability help to explain why 
homicide nearly tripled in these cities between 1960 and 1980. This study also 
found some evidence for mediating effects of poverty, routine activities, and a 
lack of social bonds on the relationship between homicide and alcohol avail- 
ability at the city level. 

In an examination of the general hypothesis that alcohol has a causal impact 
on homicide, Parker & Rebhun (1995, p. 102-17) conducted a dynamic test of 
the impact of increases in the minimum drinking age on youth homicide at the 
state level. Using data from 1976 through 1983, Parker & Rebhun (1995) esti- 
mated a pooled cross-section and time series model in which two general types 
of homicide, primary and nonprimary (based on the prior relationship between 
victim and offender), in three age categories (15—18, 19-20, and 21—24) were 
analyzed. In the presence of a number of important predictors, the rate of beer 
consumption was found to be a significant predictor of homicide rates in five 
of the six age-homicide type combinations, and increases in the minimum 
drinking age had a negative and significant impact on primary homicides in all 
age categories. 


Violence Across Time and Space: The Cultural 
Consequences of Availability 


In another theoretical formulation that attempts to explain the links between 
alcohol availability and violence, Maria Luisa Alaniz, Robert Nash Parker, 
and others (1998, 1999) propose some mechanisms by which the spatial distri- 
bution of alcohol outlets and the targeted advertising of alcohol to particular 
communities—in both the spatial and demographic sense—may mediate this 
relationship. 

The work of Alaniz et al (1998, 1999) focuses on the relationship of youth 
violence to alcohol availability. Given the recent increases in youth violence, 
including the increasing proportional contribution to overall rates of lethal vio- 
lence (Blumstein 1995), this appears to be a very fruitful line of research to 
pursue, if one of the ultimate goals of such research is the prevention and re- 
duction of the incidence of violence. Additionally, these authors propose that 
due to the differences in cultural and legal status for alcohol and drugs (even 
taking into account the illegality of alcohol to minors), the relationship be- 
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tween illicit drugs and violence is more likely to stem from properties of the il- 
licit distribution system (see Goldstein 1985), while the relationship between 
alcohol and violence would be expected to be more related to ingestion of the 
substance, whether due to the effects of pharmacology, cultural expectancies 
surrounding alcohol’s use, or both (Parker 1995, Alaniz et al 1998). 

The authors propose two pathways by which alcohol availability may be re- 
lated to youth violence. The first of these is largely grounded in Parker & Reb- 
hun's (1995) selective disinhibition approach, in specifying the ways in which 
norms proscribing violence may be overcome (disinhibited) given the particu- 
lar characteristics of a social situation, including the presence of alcohol. The 
second considers the distribution of alcohol outlets in physical space and the 
ways in which this distribution may produce "great attractors" (Alaniz et al 
1998, p. 14), areas where social controls of all kind are diminished, if not com- . 
pletely absent; such areas have also been conceptualized as "hot spots" (Sher- 
man et al 1989, Roncek & Maier 1991) and “deviance service centers" (Clair- 
mont & Magill 1974). Alaniz et al theorize that in this kind of "anything-goes" 
atmosphere (1998, p. 15), active constraint may be more likely to become dis- 
abled. Add to this the kinds of circumstances in which youths usually drink; 
due to the illegal status of alcohol for minors, youths must usually consume al- 
cohol in “semi-private” spaces, such as cars or deserted public parks, “thus 
[further] limiting the effectiveness of most external forms of social control” 
(Alaniz et al 1998, p. 13) 

Alaniz et al (1999) also highlight the role of advertising in helping to articu- ` 
late the link between outlet density and youth violence that 1s particularly rele- 
vant in minority communities, which bear a disproportionate share of all types 
of violence, including youth violence. This aspect of the theory is further ex- 
plicated by Alaniz & Wilkes (1995), who undertook a semiotic analysis of al- 
cohol advertising targeted at Latino communities. The authors argue that such 
attempts to target minority communities are very effective because, for minor- 
ity groups in the United States, 


. .the state exhibits indifference or hostility to claims of citizenship; the mar- 
ket openly embraces the same people...components of Latino cultural arma- 
ture are appropriated by advertisers, reinvented, and returned...[;] this form 
ofreinvention constructs a symbolic system that builds alcohol consumption 
1nto an idealized lifeworld ofits constituents (Alaniz & Wilkes 1995, p. 433) 


While this process of transforming cultural symbols into the commodity 
form is relevant for all sorts of products and services, it is especially relevant in 
the case of alcohol, given the highly charged nature of cultural expectancies 
surrounding its use (Brown et al 1987). In support of this thesis, Alaniz et al 
(1999) found that the density of alcohol advertising using sexist and demean- 
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ing images of minority women was associated, at the neighborhood level, with 
rates of sexual violence against females aged 12-18. 

The importance of context and the cultural effects of advertising on youths 
is demonstrated particularly well by the findings of researchers who initially 
set out to study links between illicit drugs and delinquency among Latino 
youth populations; these researchers found that tobacco use was significantly 
related to violent delinquency, while the use of alcohol and illicit drugs was not 
found to be so related. The authors explain this finding thus: 


Youngsters who use tobacco act out tobacco-associated identities available 
in the media and popular culture. They express a range of symbols about 
themselves that suggest being independent, adult, adventuresome, and 
tough. These values are also associated with drug use and violent delin- 
quency. (Watts & Wright 1990, p. 152) 


Goldstein's Tripartite Framework 


In 1985, Paul J Goldstein made an explicit attempt to develop a theoretical 
framework to describe and explain the relationship between drugs and vio- 
lence. Goldstein developed a typology of three ways 1n which drug use and 
drug trafficking may be causally related to violence. 
“Psychopharmacological violence" is violence that stems from properties 
of the drug itself. In Goldstein's framework, this can be violence associated 
with drug ingestion by the victim, the perpetrator, or both. “Economic compul- 
sive violence” is violence associated with the high costs of illicit drug use. This 
type of violence does not stem directly from the physiological effects of drugs 
but is motivated by the need or desire to obtain drugs. Based on the capacity to 
induce physical dependency, the drugs one would expect to be most often as- 
sociated with economic compulsive violence would be opiates (particularly 
heroin) and cocaine, due to the capacity of these to produce strong physical and 
psychological dependencies in users. “Systemic violence” is defined by Gold- 
stein as that type of violence associated with “traditionally aggressive patterns 
of interaction within the system of drug distribution and use” (Goldstein 1985, 
p. 497). Goldstein maintains that the risks of violence are greater to those in- 
volved in distribution than to those who are only users (Goldstein et al 1989). 
In the years since Goldstein’s original formulation, a fairly large number of 
empirical studies have been undertaken using this framework. Nearly all of 
them have been produced by researchers associated with Narcotic and Drug 
Research, Inc. as part of one of two major research initiatives; these are the 


3it should be pointed out that the majority of the variance in violent delinquency is explamed by 
prior incarceration; however, tobacco use alzo emerges as a significant, albeit weaker, predictor of 
violent delinquency, thus highlighting the importance of social context m the links between 
substance use and violent behavior 
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Drug Relationships in Murder (DREIM) and the Drug Related Involvement in 
Violent Episodes/Female Drug Related Involvement in Violent Episodes 
(DRIVE/FEMDRIVE) projects. 

The DREIM project involved extensive interviews with 268 homicide of- 
fenders incarcerated in New York State correctional facilities. One of the pur- 
poses of this project was to gain a more extensive understanding than that af- 
forded by official police records of the role that drugs and alcohol play in 
homicide. 

The DREIM project data indicated that the substance most likely to be used 
by homicide offenders on a regular basis as well as during the 24 hours directly 
preceding the crime was, overwhelmingly, alcohol. Marijuana and cocaine 
were the second and third most frequently implicated drugs in the lives of 
homicide offenders as well as in the offense itself (Spunt et al 1994, 1995). 

Other empirical invesiigations that rely on the Goldstein framework have 
attempted to classify the relationship between drugs and all types of violence, 
under the auspices of the DRIVE/FEMDRIVE research initiative. The data 
collection for this project consisted of interviews with 152 male and 133 fe- 
male subjects concerning both drug and alcohol use and also their participation 
in violent events, over an eight week period. In one analysis, Spunt et al (1990) 
reported that violent events are drug-related if any of the participants report 
drug use proximate to the incident; similarly, if there is no link to drug distribu- 
tion or robbery, these “drug-related events" are classified as psychopharmaco- 
logical. These researchers fail to identify any mechanism by which these psy- 
chopharmacological effects of drugs manifest themselves in violent behavior. 
For example, they conclude that "heroin and methadone were the [illicit] drugs 
most likely to be associated with psychopharmacological violence" (Spunt et 
al 1990, p. 299), despite the fact that virtually no evidence exists to support 
individual-level associations between opiate use and violence (Kaplan 1983, 
Ball et al 1982, Ball 1991). 

Goldstein et al (1989) reported the results of research that was concerned 
primarily with the effect of the "crack epidemic" on homicide. Utilizing data 
from official police reports of homicides supplemented by an observational in- 
strument designed by Goldstein and his research team, the authors concluded 
that slightly over half of the 414 New York City homicides sampled were 
drug-related. Evidence from official records indicated that 6596 of these drug- 
related homicides involved crack cocaine as the primary substance, while an- 
other 2296 were related to other forms of cocaine; combined, nearly 9096 of 
drug-related homicides in the sample involved cocaine. Of these, the over- 
whelming majority (74.396 of all drug-related homicides) were classified as 
"systemic" by the researchers. Interestingly, all homicides in which alcohol 
was the primary substance involved were classified as psychopharmacologi- 
cal. 
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Another example of the use of the Goldstein typology is the analysis of nine 
female homicide offenders, reported by Brownstein et al (1994). This analysis 
provides further evidence that alcohol is the substance most commonly associ- 
ated with homicide. The authors also conclude from these data that the use of 
alcohol or drugs by either perpetrator or victim proximate to the homicide 
makes the homicide primarily drug- or alcohol-related (Brownstein et al 1994, 
p. 110) despite the fact that the authors report, in some cases, long histories of 
spousal abuse on the part of the homicide victim, which another researcher 
might consider at least as important a causal factor as the fact of drug or alco- 
hol consumption in leading up to the homicide. 

A central problem that characterizes all the work that utilizes the Goldstein 
tripartite framework is that it 1s not treated as a set of testable propositions but 
rather as a set of assumptions about the nature of drug- and alcohol-related vio- 
lence. Because of this, studies guided by this set of assumptions do not address 
the task of explaining mechanisms by which violent events might be related to 
the presence or use of drugs or alcohol; additionally, all of these studies fail to 
provide a detailed explanation of the way in which study events come to be 
classified into one type or another. Another problem with Goldstein’s classifi- 
catory scheme is that the categories are not mutually exclusive. For example, 
many of the situations coded by researchers as events of systemic violence are 
economic in nature. Robbery of a drug dealer would seem to be an economi- 
cally motivated crime but is classified as systemic in this framework, based on 
drug trafficking involvement of the victim and/or perpetrator. In short, the 
Goldstein framework seems biased toward support of the systemic model of 
drug-effected violence, which also limits the utility of the framework for ex- 
plaining the relationship between alcohol—the substance most frequently im- 
plicated in violent events of all kinds—and violence. Additionally, the rigidity 
and inherently descriptive nature of the classification scheme fails to take into 
account the possibility of interactions between social context, individuals, and 
pharmacology. 


CONCLUSIONS 


Several clear conclusions can be drawn from this extensive review of the lit- 
erature concerning drugs, alcohol, and violence. One 1s the overwhelming im- 
portance of context in any relationship that may exist between substance use 
and violent behavior. Our review of the literature finds a great deal of evidence 
that the social environment is a much more powerful contributor to the out- 
come of violent behavior than are pharmacological factors associated with any 
of the substances reviewed here. 

The other consistent finding that we can report from this review of the em- 
pirical evidence is that when violent behavior is associated with a substance, 
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that substance is, overwhelmingly, alcohol. Study after study indicates that, 
even in samples containing relatively high baseline rates of illicit drug use, 
violent events are overwhelmingly more likely to be associated with the con- 
sumption of alcohol than with any other substance. In fact, a review of the lit- 
erature concerning rates of co-occurrence of violent crimes with the use of il- 
licit substance fails to provide any support whatsoever for a link. The 1991 
Criminal Victimization Survey indicates that less than 5% of violent assailants 
were perceived by their victims to be under the influence of illicit drugs; the 
corresponding figure for alcohol is more than four times that. 

The consensus among the authors of previous reviews of research on alco- 
hol, drugs, and violence (Roizen 1993, Collins 1981, Pernanen 1991) was that 
evidence existed for an association especially between alcohol and violence, 
but that the research base would not support any stronger conclusions. These 
and other reviews would invariably end with a call for more and better research 
to address the 1ssue of whether evidence about a causal relationship between 
alcohol, drugs, and violence could be found. What was missing from those re- 
views, however, was a full recognition of the importance of theoretical devel- 
opment in the search for evidence about causality. Until the last ten years, such 
efforts were largely absent; a number of the studies cited here would replicate 
associational findings and end with this same lament about the absence of 
causal evidence. However, recent developments, especially the work of Gold- 
stein and colleagues, Fagan, and Parker and colleagues, have led to an in- 
creased conceptual and theoretical base from which questions of causality can 
be better assessed. None of these approaches has succeeded in fully theorizing 
the potential relationships among alcohol, drugs, and violence, and none of 
these perspectives has provided definitive empirical tests of these theoretical 
models. Indeed, all of these approaches need more theoretical development as 
well as better data and methodological approaches to advance the state of 
knowledge about these relationships. However, at least it is reasonable to 
claim that research on alcohol, drugs, and violence demonstrates some promis- 
ing theoretical approaches and some useful empirical studies based on those 
approaches. Much work is yet to be done, but the prospects for greater under- 
standing of how and why alcohol and drugs contribute to violence have never 
been brighter. 
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ABSTRACT 


Although it is evident in routine decision-making and a crucial vehicle of ra- 
tionalization, commensuration as a general social process has been given lit- 
tle consideration by sociologists. This article defines commensuration as the 
comparison of different entities according to a common metric, notes com- 
mensuration's long history as an instrument of social thought, analyzes com- 
mensuration as a mode of power, and discusses the cognitive and political 
stakes inherent in calling something incommensurable. We provide a frame- 
work for future empirical study of commensuration and demonstrate how 
this analytic focus can inform established fields of sociological inquiry. 


INTRODUCTION 


Consider three examples. Faculty at a well-regarded liberal arts college re- 
cently received unexpected, generous raises. Some, concerned over the dispar- 
ity between their comfortable salaries and those of the college's arguably un- 
derpaid staff, offered to share their raises with staff members. Their offers 
were rejected by administrators, who explained that their raises were “not 
about them." Faculty salaries are one criterion magazines use to rank colleges. 
Administrators, mindful of how fateful these rankings are, wished to protect 
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their favorable ranking with preemptive faculty raises. Partly because college 
raters pay closest attention to professors’ incomes, faculty and staff compensa- 
tion plans are not considered comparable. 

Several working mothers recently described their strategy for managing 
their anxiety about the amount of time they spend away from their young chil- 
dren. Each week, they calculate a ratio of mom-to-caregiver hours. If the ratio 
is close, or favors mom, they feel better. One woman admitted to *fudging" her 
numbers to produce a guilt-ameliorating figure. Àn opposite appeasement 
strategy involves the invention of “quality time," when harried parents try to 
convince themselves that what matters is the richness, rather than the volume, 
of time spent with their children. The emergence of "quality time" as a way to 
mark the specialness of parental involvement corresponded to the large influx 
of mothers moving into the paid work force. But some mothers who embrace 
traditional roles, or who sacrifice careers and income to stay home with their 
children, sniff at the self-serving aroma of "quality time" (Hays 1996, Berger 
1995:43-44). 

An economist evaluating a proposed dam faced the problem of how to esti- 
mate the value of tubing down the river, an activity that the proposed dam 
would eliminate. Committed to including in his analysis the "cost" of losing 
this recreation enjoyed by thousands each warm weekend, he tried to synthe- 
size a demand curve for tubing. Despite valiant efforts and sizable expendi- 
tures, his efforts to derive a robust price for tubing failed. As is common with 
characteristics that are hard to measure, the value of tubing was excluded from 
the analysis of the dam (Espeland 1998). 

Commensuration—the transformation of different qualities into à common 
metric—is central in each of these examples. Whether it takes the form of 
rankings, ratios, or elusive prices, whether it is used to inform consumers and 
judge competitors, assuage a guilty conscience, or represent disparate forms of 
value, commensuration is crucial to how we categorize and make sense of the 
world. 

The consequences of commensuration are complex and varied. Commensu- 
ration can render some aspects of life invisible or irrelevant, as the failure to 
price river tubing illustrates. The expansion of commensuration can be a politi- 
cal response to exclusion or inequality. This tactic is embraced by some law- 
makers, environmentalists, and bureaucrats (including the economist just de- 
scribed) who wish to expand what is considered relevant in bureaucratic deci- 
sions (Taylor 1984, Espeland 1998), by women advocating comparable worth as 
a means for redressing pay inequity (Nelson & Bridges forthcoming, England 
1992), or by economists grappling with problems of externalities (Baumol & 
Oates 1979). For the working mothers, commensuration can be a deeply per- 
sonal way to negotiate difficult contradictions. But rejecting commensuration 
as an appropriate expression of value can also be a political response for those, 
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like some homemakers, who see their identities jeopardized by the commodifi- 
cation of their work and the quantification of their investments. 

We argue that commensuration is no mere technical process but a funda- 
mental feature of social life. Commensuration as a practical task requires enor- 
mous organization and discipline that has become largely invisible to us. Com- 
mensuration is often so taken for granted that we forget the work it requires 
and the assumptions that surround its use. It seems natural that things have 
prices, that temporality is standardized, and that social phenomena can be 
measured. Our theories presume that we commensurate when choosing and 
that values can be expressed quantitatively. Commensuration changes the 
terms of what can be talked about, how we value, and how we treat what we 
value. It 1s symbolic, inherently interpretive, deeply political, and too impor- 
tant to be left implicit in sociological work. 

Commensuration warrants more sustained and systematic treatment. 
(Scholars working toward this project include Porter 1995; Radin 1996; An- 
derson 1993; Espeland 1992, 1998; Hurley 1989; Sunstein 1994; Desrosiéres 
1990; and Zelizer 1994.) We need to explain variation in what motivates peo- 
ple to commensurate, the forms they use to do so, commensuration’s practical 
and political effects, and how people resist commensuration. This is possible 
only when commensuration is investigated as a field. Our neglect of commen- 
suration as a general phenomenon and our failure to provide a framework for 
its investigation as such have kept us from appreciating its social and theoreti- 
cal significance. Our goal 1s to begin building such a framework. We start by 
defining commensuration, describing its long intellectual history, and explain- 
ing its significance. We then discuss the cognitive and political stakes inherent 
in calling something incommensurable, we offer guidelines for future empiri- 
cal studies, and we illustrate how this focus can illuminate current sociological 
research. 


WHAT IS COMMENSURATION? 


Commensuration is the expression or measurement of characteristics normally 
represented by different units according to a common metric. Utility, price, 
and cost-benefit ratios are common examples of commensuration, although 
the logic of commensuration is unplicit ın a very wide range of valuing sys- 
tems: college rankings that numerically compare organizations; censuses and 
social statistics that make cities and nations numerically comparable; actuarial 
projects that attempt to quantify and compare vastly different kinds of risks; 
commodity futures that make uniform units out of products that may not yet 
exist; voting, and the pork-barrel trading of diverse interests that often lies be- 
hind it; calculation of different kinds of work in terms of labor costs; and the ad 
hoc calculations of trade-offs among such potentially incomparable values as 
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career and family, breadth and depth in scholarship, and freedom and commit- 
ment in love. 

Commensuration transforms qualities into quantities, difference into mag- 
nitude. It is a way to reduce and simplify disparate information into numbers 
that can easily be compared. This transformation allows people to quickly 
grasp, represent, and compare differences. One virtue of commensuration is 
that it offers standardized ways of constructing proxies for uncertain and elu- 
sive qualities. Another virtue is that it condenses and reduces the amount of in- 
formation people have to process, which is useful for representing value and 
simplifying decision-making. The complexity of decisions has propelled the 
spread of commensuration in decision-making (Stokey & Zeckhauser 1978); 
so too has our growing appreciation of people’s cognitive limitations (Tversky 
& Kahneman 1974, 1981; Thaler 1983; for a good review see Heimer 1988). 
Commensuration makes possible more mechanized decision-making. Com- 
puter programs that calculate utility functions, elicit and measure values, and 
identify alternatives that maximize people’s utility can assure the consistency 
that people lack; in some cases, they mechanically tell people what to do. The 
technical advantages of commensuration can be enormous, but sometimes its 
symbolic and political advantages are paramount (Feldman & March 1981). 

Commensuration sometimes responds to murky motives. It may be 
prompted by a desire to look rational, limit discretion, or conform to powerful 
expectations. Commensuration may be spurred by a desire to expand democra- 
tization (Cohen 1982, Espeland 1998), or by a wish to hide behind numbers, 
impose order, or shore up weak authority (Porter 1995). Commensuration can 
provide a robust defense for controversial decisions, expand a group's organ- 
izational or professional turf, or even be a means to appease God (Carruthers & 
Espeland 1991). 

Our desire to manage uncertainty, impose control, or secure legitimacy pro- 
pels us to create a dazzling array of strategies to use when we standardize. The 
scripts delivered by salespeople, the forms we use when we enroll our children 
in kindergarten or visit the doctor, and the practiced smiles of flight attendants 
are all forms of standardization. What distinguishes commensuration from 
other forms of standardization is the common metric it provides. When com- 
mensuration is used in decision-making, the procedure for deriving this metric 
amounts to a series of aggregations. 

Most quantification can be understood as commensuration because quanti- 
fication creates relations between different entities through a common metric. 
Commensuration is noticed most when it creates relations among things that 
seem fundamentally different ; quantification seems distinct from commensu- 
ration when the objects linked by numbers already seem alike. When we as- 
sume the unity conferred by numbers, when the homogeneity among things 
appears to be a property of the object rather than something produced by quan- 
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tification, then we imagine we are simply counting or measuring something 
rather than commensurating disparate entities. For example, the census ap- 
pears to be a method for counting people rather than a mechanism for con- 
structing and evaluating relations among citizens of a state or region. This is 
because implicit in the act of counting is a conception of citizenship or identity 
that renders unproblematic the coherence of the relations among diverse peo- 
ple. As Theodore Porter (1986:24) put it, “It makes no sense to count people if 
their common personhood is not seen as somehow more significant than their 
differences."! 

Commensuration 1s fundamentally relative. It creates relations between at- 
tributes or dimensions where value is revealed in the comparison. When used 
to make decisions, commensurated value is derived from the trade-offs made 
among the different aspects of a choice. Value emerges from comparisons that 
are framed in terms of how much of one thing is needed to compensate for 
something else. In complex choices, commensuration often occurs at several 
levels of analysis. For example, before building a dam, analysts want to know 
how the dam would affect the quality of water. Water quality has many dimen- 
sions (e.g. temperature, the amount and nature of dissolved solids, turbidity, 
pH), and even though these dimensions are already quantified, they are meas- 
ured with different scales. Aggregating these attributes according to some 
broader metric creates “water quality.” 

The structure of value rooted in trade-offs is like that of an analogy: Its 
unity is based on the common relationship that two things have with a third 
thing, a metric. How difficult or controversial commensuration is depends 
partly on whether it is used routinely to express the value of something, on 
whether people accept it as a legitimate expression of value, and on how 
disparate-seeming are the entities being commensurated. For example, com- 
modification has become so naturalized that it 1s hard to construe the value of 
some goods in forms apart from price. 

Commensuration can be understood as a system for discarding information 
and organizing what remains into new forms. In abstracting and reducing in- 
formation, the link between what 1s represented and the empirical world is ob- 
scured and uncertainty is absorbed (March & Simon 1958:138-39, 150-51). 
Everyday experience, practical reasoning, and empathetic identification be- 
come increasingly irrelevant bases for judgment as context is stripped away 
and relationships become more abstractly represented by numbers. 


Counting and measuring may be controversial if the hkeness or comparability of the units 
being counted 1s disputed For example, during the Constitutional Convention of 1787, Southerners 
who rejected slaves’ citizenship rights nevertheless wished to expand their political clout in the 
House of Representatives For the purposes of apportioning representatives, they agreed that slaves 
should count as “three-fifths of all other persons.” 
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As we demonstrate below, the forms commensuration takes vary on several 
dimensions. First, modes of commensuration vary in how technologically 
elaborated they are. Some are highly elaborated, as in the cost-benefit analyses 
first developed by government bureaucrats and then elaborated by economists 
and decision theorists to adjudicate between diverse and often costly social 
policies (Porter 1992, 1995:148—89). Other modes are only marginally elabo- 
rated, such as the often ad hoc calculations made by spouses to determine the 
relative equitability of household chores (e.g. Hochschild 1989). Second, 
modes of commensuration vary in how visible or explicit they seem. There is 
some correlation between elaborateness of a mode of commensuration and ex- 
plicitness of the project. For an economist trying to synthesize a demand curve 
for river tubing, the labor involved 1n commensuration is both deliberate and 
apparent; for spouses trying to equalize household contributions, the process 
may seem as natural as it does commonsensical. But performing some highly 
elaborated modes of commensuration, such as generating identical units of 
value in stocks or commodities futures (Cronon 1991:97—147; Porter 1995: 
45—48), are complex technical feats that seem "natural" to traders and stock- 
holders nevertheless. This suggests a third dimension of variation in modes of 
commensuration—-institutionalization—which we address in further detail be- 
low. Finally, modes of commensuration vary according to who their agents 
are. Some modes are the jealously guarded turf of distinct professional bodies; 
actuary work 1n insurance is a prime example (Porter 1995:101—13). Other 
modes are made routine and then embedded in complex divisions of labor, as 
in the lower-level diagnostic and charting work done by nurses and physican 
residents, who standardize patients in part by transforming vital signs into dis- 
crete numerical measures (Bosk 1979, Chambliss 1996). Still other modes are 
common features of everyday social experience, as in consumers’ efforts to lo- 
cate bargains at the grocery store or make trade-offs among purchases. 


HISTORICAL LEGACIES 


The linking of rationality to commensurability, and irrationality to incommen- 
surability, are old ideas that appealed to some of our deepest thinkers. As Mar- 
tha Nussbaum wrote (1984:56—57; 1986), the pairing of numbering, measure- 
ment, and commensuration with order, the pairing of comprehension with con- 
trol, and obversely, the pairing of incommensurability with chaos, anxiety, and 
threat are characteristic of Greek writing in the fifth and early fourth centuries 
BCE. Nussbaum argues that commensuration was crucial to Plato's under- 
standing of the Good, since Plato believed that we need to make our ethical 
values commensurate in order to prioritize them. Complex ethical concerns, if 
left incommensurate, would create conflict, confusion, and pain. 
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As others would argue much later, Plato believed that commensuration as a 
mode of perceiving the world would also change those who used it. Commen- 
suration would make us more rational and render human values more stable 
and less vulnerable to passion, luck, and fate. One of the great virtues of com- 
mensurability for Socrates and for Plato was that it could help us eliminate ak- 
rasia by structuring our choices in ways that make it obvious what we should 
do; commensuration would make our ethical or practical problems easy to 
solve in the same way that it is easy to choose between $50 and $200 (Nuss- 
baum 1986:114). 

But for Plato, an equally important feature of commensuration is that this 
willful elimination of the heterogeneity of values also stabilizes our emotions 
and attachments by removing motives for irrational behavior, motives such as 
commitment to passionate, singular love. If in using a general concept of value 
we can frame our choices as between more or less of the same quantity, we no 
longer feel the same wav toward those things. If we understand our lover not as 
auniquely compelling person but rather as one who provides us with some spe- 
cific amount of general pleasure, we value our lover not only differently, but 
less. The more interchangeable our lover is with someone nearly as beautiful 
or more clever, the less vulnerable we are, and the less likely we are to pursue 
our lover with reckless abandon. 

Plato’s claim is powerful. He understood that ın making us more stable and 
less passionate, commensuration was both appealing and frightening. For Ar- 
istotle, Nussbaum argues, eliminating our vulnerability, and therefore our pas- 
sion, was a prospect too disturbing. He believed our sense of beauty depends 
precisely on its ephemeral qualities, that our ethics require us to invest in the 
singularity of others. Investing in what is unique is risky, but the loss of vulner- 
ability is even more threatening, for goodness requires ethical risks, valuing 
things for their own sake, passion. For Aristotle, the fragility of goodness is 
undermined by understanding value as general and homogeneous (Nussbaum 
1986, p. 235-354). 

Commensuration can change our relations to what we value and alter how 
we invest in things and people. Commensuration makes the world more pre- 
dictable, but at what cost? For Anstotle, a price too high; for Plato, an essential 
sacrifice. The homogeneity commensuration produces simultaneously dimin- 
ishes risk and threatens the intensity and integrity of what we value. These two 
important themes found in Plato's and Aristotle's view of ethics reemerge in 
our most compelling critiques of modernity: in Karl Marx's critique of capital- 
ism, where commodification distorts human relations by turning people into 
means and things into ends; in Max Weber's analysis of the constraints of the 
iron cage and the disenchantment attending rationalization; and in Georg Sim- 
mel's analysis of money, where the objectification of value inserts distance be- 
tween us and what is valued, fostering intellectualization and detachment. 
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Commensuration is crucial for capitalism and so is a prominent theme in 
Karl Marx's work. For Marx, commensuration is key for understanding the 
central social categories of capitalism: labor, value, commodity, and money. 
Marx argued that under capitalism, labor is the great commensurator. Value is 
derived from labor, and the commensuration of value is also accomplished 
through labor. Value exists in precise quantities ın all commodities but is not 
measured according to the particular products of various kinds of labor. 
Rather, value is expressed in terms of what these have in common: the general 
experience of labor, what Marx (1976 [1867]:992) calls abstract labor, which 
is measured as labor-time.* 

Labor has dual qualities. Concrete labor is the distinctive labor process 
shaped by the particular things it produces, things that have specific uses for 
people, such as food to eat or clothes to wear. Concrete labor produces use- 
value, but it does not produce value in the general sense. Abstract labor, on the 
other hand, is the “socially necessary general labor" that produces undifferen- 
tiated value. It is the peculiar way we obtain goods under capitalism, where 
what we produce has no intrinsic relation to the products we ultimately acquire 
through our labor (Postone 1996, p. 149). Value, as an expression of abstract 
labor, commensurates because it abstracts away the distinctiveness of the par- 
ticular forms of work, objects of work, and practical uses of these objects. The 
concrete labor that produces use-value and the abstract labor that constitutes 
value are not two separate kinds of labor but rather two aspects of labor under 
capitalism (Postone 1996:144). 

The theoretical commensuration implicit in Marx's conception of abstract 
value is in turn crucial for his conception of commodity—the basic social form 
of capitalism. Abstract labor is what is common to all commodities. A com- 
modity possesses both use-value and exchange value: It is both a product and a 
social relation that embodies exchange value. For use-value, labor matters 
qualitatively; for exchange-value, it matters quantitatively (Marx 1976 
[1867]:136). The tension between qualitative expressiveness and use of labor, 
and its quantitative expressiveness (between what is incommensurate and 
what is commensurate), is part of the dual and contradictory nature of com- 
modities. One way that commodification debases human life is that qualitative 
differences become quantitative differences: People become means, things be- 
come ends. 

Money, for Marx, is not what makes commodities commensurate. This is 
an illusion (Marx 1976 [1867]:188); instead, money is an expression of the 
commensuration already embodied in abstract labor. But money is powerful, 
partly for the illusions that it helps sustain. As a means of circulating com- 


7 This discussion of abstract labor 1s mdebted to Postone (1996, pp 123-85) 
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modities, money obscures the social relations behind them. Because it allows 
us to buy anything, money becomes the universal object of possession. The ul- 
timate "pimp," money mediates between our needs and the object of our needs, 
between life and the means of life, between my life and others (Marx 1976 
[1867]:102). Money appears to us more real than the relations behind it, an end 
rather than a means to an end. Under capitalism, qualities are quantified, and 
all qualitative needs that cannot be expressed quantitatively, or bought, are in- 
hibited (Heller 1976:55). 

Commensuration and its limits are central themes in Max Weber's investi- 
gations of rationalization, which often parallel the dialogue between Plato and 
Aristotle on the virtues and threat of reason based on calculation. Weber's am- 
bivalence did not allow him to choose sides. For Weber, the expanding role of 
calculation as a strategy to manage uncertainty was a central feature of West- 
ern rationalism and crucial for the development of capitalism. The growing 
importance of knowledge and technical expertise in everyday life, the increas- 
ing depersonalization of structures of power and authority, and our expanding 
control over material objects, social relations, and self are unifying character- 
istics of Western rationalism (Brubaker 1984:29—35). Calculation and stan- 
dardization were crucial in each of these processes. 

To take one example, Weber (1981:276) argued that rational capital ac- 
counting, a sophisticated form of commensuration, was essential to the devel- 
opment of modern capitalism. Accounting allows capitalists to rationally 
evaluate the outcomes of past investments, to calculate exactly the resources 
available to them and project future income, and to assess and compare future 
investments. Accounting reconceptualizes and depersonalizes business rela- 
tions and fosters an objective stance toward business. But as Weber shows us, 
efforts to rationalize can be hard-fought battles. Those who benefit from an ex- 
isting system of authority often resist mightily the intrusion of commensura- 
tion that threatens their privilege (Swetz 1987:181—82; Weber 1981:224). 

The efficiency of bureaucracies and economic transactions depends on 
their growing depersonalization and objectification. The impersonality of eco- 
nomic and bureaucratic rationality is vastly enhanced by commensuration, be- 
cause it standardizes relations between disparate things and reduces the rele- 
vance of context. This impersonality is hostile to ethical systems that depend 
on personal ties, which explains why religious elites often have aligned with 
aristocrats to protect patriarchal relations. 

One way to think about the tension between ethical systems and formal ra- 
tionality is to conceptualize it as a contest over the limits of commensuration. 
Ethical and political systems based on personal relations often emphasize the 
uniqueness of individuals or the distinctive relations between certain catego- 
ries of individuals. But rational systems depend on numerous forms of com- 
mensuration, on bureaucracies that strictly separate offices from their incum- 
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bents, and on the elaborate rules that define offices. One might interpret formal 
rationality as rendering offices unique and the people who hold the offices 
commensurable. In this way, the incommensurability of individuals that is ba- 
sic to much ethics confronts the radical commensuration of formal rationality. 
Conflicts generated by such confrontations are irreconcilable. 

Where Weber emphasized the technical superiority of rational forms, Sim- 
mel was attentive to their symbolic and constitutive power. He investigated 
how our collaboration with social forms changes us. Simmel’s (1978) ex- 
tended analysis of money offers a brilliant analysis of commensuration. Sim- 
mel sees money as largely responsible for the increasing divergence between 
the objective and subjective culture that characterizes modern life. Money 
speeds up the pace of the production of cultural forms, making it harder for in- 
dividuals to assimilate them. Money advances the development of people’s in- 
tellectual faculties over their emotional faculties because of its vibrant instru- 
mentality, its character as the “perfect tool.” This quality extends the causal 
connections we make between things to such an extent that the end point, the 
ultimate value, becomes obscure. This 1s what accounts for the “calculating 
character of modern times,” where people become obsessed with “measuring, 
weighing, and calculating” (Simmel 1978:443—44). 

When a form becomes taken for granted as a means of understanding rela- 
tionships and values, things that are hard to assimilate to the form seem in- 
creasingly unreal. Money also contributes to the transformation of substantive 
values into money values; this homogenizes life, but it also offers autonomy, 
even freedom. Simmel concludes his analysis with a profound point: Over 
time, money increasingly approximates a pure symbol of the relativity of value 
and of the relativistic character of existence more generally—a character that 
money helped to define (Simmel 1978:512). 

Simmel's insights about money can be usefully extended to other forms of 
commensuration. Utility, for example, is an even more enveloping form of 
relativity because 1t embodies the relativity of all value, even of those things 
without prices. Utility can precisely convey any value and its relation to any 
other value, whether it is fresh air, children, or even death.? 

The compulsion to create forms stems from our need to make sense of the 
world, but as Simmel understood, forms may possess a force that seems to ad- 
here to them independent of their users. Forms create expectations as well as 
coherence, and a form's familiarity encourages our complicity. This complic- 


One important difference between money and other forms of commensuration is that some 
commensurated forms are even more abstract than money, having no tangible existence that makes 
their symbolic expression less distant. Utility cannot be inscribed with the faces of queens, so 
perhaps this makes it a less effective symbol Because some forms of commensuration, including 
money, are so closely tied to our notions of rationality, these forms can symbolize rationality. 
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ity enhances the rhetorical appeal of forms and is one reason we find them 
compelling (see Burke 1969:58—59). Commensuration encourages us to be- 
lieve that we can integrate all our values, unify our compartmentalized worlds, 
and measure our longings. 

Fundamental to classical critiques of modernity, commensuration is also 
central to many contemporary versions of rational choice theory. Rational 
choice theory varies in its assumptions, goals, and applications, but many ver- 
sions make commensuration a prerequisite for rationality. Steinbrunner 
(1974:25—46) characterizes rational choice theory as deriving from three key 
assumptions: First, that separate dimensions of values are integrated via trade- 
offs in a deliberate balancing of competing claims of values. Integration is ac- 
complished by creating some metric that gives the worth of one value in terms 
comparable to the other, and this commensuration of values must occur in ad- 
vance of the final analysis of outcomes. Utility conceptually integrates values; 
it 1s a measure of absolute value, an ideal measure that would subsume all di- 
mensions of value and provide a basis for making comparisons between 
choices. Second, alternative outcomes are evaluated and analyzed based on 
predictions about their consequences. Third, people adjust their expectations 
as more is known about how alternatives will perform, but these new expected 
outcomes are evaluated by the same metric. 


WHY COMMENSURATION MATTERS 


Investigating commensuration is important because it is ubiquitous and de- 
mands vast resources, discipline, and organization. Commensuration can radi- 
cally transform the world by creating new social categories and backing them 
with the weight of powerful institutions. Commensuration is political: It re- 
constructs relations of authority, creates new political entities, and establishes 
new interpretive frameworks. Despite some advocates' claims, it is not a neu- 
tral or merely technical process. 

Commensuration is everywhere, and we are more likely to notice failures of 
commensuration than ics widespread, varied success. Our faith in price as a 
measure of value is so naturalized that we now routinely simulate markets for 
elusive and intangible qualities. Although efforts to price tubing might have 
failed, there are well-established procedures for attaching prices to everything 
from corporate goodwill to surrogate pregnancies. 

Where markets do not exist they are often invented. Corporations routinely 
create internal markets for the goods and services produced by subunits, and 
these fictive prices matter enormously in people's jobs (Eccles 1985). Some 
business schools require students to bid for their courses. Economists advocate 
creating markets in pollution to help curtail both pollution and theoretically 
unsavory externalities (Baumol & Oates 1979). Insurers work to quantify such 
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consequential uncertainties as the professional reputations of their clients 
(Heimer 1985). 

Economists have developed dazzling techniques for measuring utility, and 
its conceptual and practical influence is hard to overstate. There is hardly an is- 
sue in government that is not framed by the logic of cost-benefit analysis; its 
deployment in matters of health care and safety (Jasanoff 1989, Weisbrod 
1961), education and environment (Smith 1984), and program evaluation (Kee 
1994:456—88) is routine. Social science is often synonymous with measure- 
ment and model-building. Commensuration is fundamental to management, 
regardless of whether its object is art or widgets. Bureaucrats and analysts use 
sophisticated decision models requiring commensurated values when making 
decisions on everything from welfare to warfare. We devote enormous re- 
Sources to commensuration. We have industries, agencies, and disciplines 
dedicated to measuring and managing risk (Heimer 1985, Jasanoff 1986, Clark 
1989), measuring public opinion, quantifying intelligence (Carson 1993), 
simulating prices (Portney 1994), assessing values, and making decisions—all 
of which depend on our capacity to commensurate anything. 

Commensuration is a radical social form, partly because of the assumptions 
that inform its use. Its long associations with rationality make it ideologically 
potent. Assuming that values can be made commensurate and that commensu- 
ration is a prerequisite to rationality are powerful ideas. Embedded in this logic 
is another assumption: that all value is relative and that the value of something 
can be expressed only in terms of its relation to something else. This form of 
valuing denies the possibility of intrinsic value, pricelessness, or any absolute 
category of value. Commensuration presupposes that widely disparate or even 
idiosyncratic values can be expressed in standardized ways and that these ex- 
pressions do not alter meanings relevant to decisions. 

Commensuration is radically inclusive. It offers an abstract form of unity 
that can potentially encompass any valued thing. Whether commensuration is 
accomplished in a price, utility curve, cost-benefit ratio, or multi-attribute 
trade-off scheme, any value or preference can be made commensurate with 
any other. The capacity to create relationships between virtually anything is 
extraordinary in that it simultaneously overcomes distance (by creating ties 
between things where none before had existed) and imposes distance (by ex- 
pressing value in such abstract, remote ways). In doing so, commensuration 
creates new things, new relations among disparate and remote things, and 
changes the meanings of old things (Goody 1986). 

According to Hacking (1990:181—95), from 1820 to 1840, unprecedented 
and nearly universal numerical enthusiasm produced an "avalanche of num- 
bers.” One result was the discovery of an astonishing number of regularities: in 
worker illness, suicide, crime, epidemics, and childbearing. Determinism was 
a casualty of the exponential growth in the production of numbers, as quests 
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for “exactness” gave way to relentless efforts to understand and tame chance. 
Another consequence was the rapid proliferation of categories—categories in- 
vented to name and sort the newfound regularities, categories that then became 
constitutive. 

The category of “society,” Porter argues, is largely a statistical construct 
(1995:37, 1986:156—57). The regularities revealed in suicide and crime could 
not be attributed to individuals. A broader category was needed to account for 
them, and beginning around 1830, they were designated properties of society. 
Such regularities were powerful evidence of the autonomous existence of soci- 
ety, of “collective forces," as Durkheim famously argued (1951:297—325; 
Hacking 1990:182). Society was soon understood as an aspect of life even 
more basic than state (Collini 1980:203). Interpreted as statistical laws that 
governed naturally, these regularities helped buttress laissez-faire liberalism. 
But the invention of crime rates in the 1830s and of unemployment rates some 
70 years later as societal characteristics helped define these as collective re- 
sponsibilities worthy of reform, rather than the just desserts of unworthy per- 
sons (Himmelfarb 1991:41). 

Even controversial or artificial-seeming products of commensuration, once 
backed by powerful institutions, become real, fateful, and autonomous. As 
Porter notes (1995:41—42), bureaucrats and activists have turned Americans of 
Puerto Rican, Mexican, Cuban, Iberian, and Central and South American de- 
scent into "Hispanics." Once such statistical categories become routinized in 
bureaucracies or written into law, they became increasingly real and fateful. 
Deployed by bureaucrats and politicians, distributed by media, and analyzed 
by social scientists, their use gives them meaning, consequence, and objectiv- 
ity. Official statistics become, in Latour & Woolgar's term (1986), "black 
boxes" that are hard to discredit or even to open. 

Economic integration requires commensuration. The capacity to commen- 
surate time, labor, product, monies, and securities has helped create a world 
where a powerful, if invisible, relationship exists between the unemployed fac- 
tory worker in the United States and the child laborer in Malaysia.* Commen- 
suration makes possible precise comparisons across vast cultural and geo- 
graphical distances that allow transactions fundamental to global markets. The 
worldwide ascendancy of finance and service industries has propelled com- 
mensuration, one by-product of which is an increasing polarization of wealth. 


4Commensuration was central to Taylor's (1947) efforts to control labor. Armed wrth 
stopwatches and calculations, scientific management would reduce work to its most elemental, 
standardized forms; Taylor wished to make management a scientific endeavor governed by rules 
and calculations and to transform relations between workers and managers by depersonalizing 
authority (Bendix 1956 274—81) The wages and perks of many Americans who talk on phones for 
a living are linked to performance evaluations performed by computers that track the volume and 
length of their calls and the seconds between them (Schwartz 1994.240—41) 
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The development of an international property market means that real estate 
prices in Manhattan are linked to those in London or Paris and are shaped by 
flows of capital from Japan or Hong Kong (Sassen 1994:5—6, 99—117). Japa- 
nese investors' forays into New York real estate, for example, drove up prices 
and squeezed many small businesses out of the market. 

When built into large institutions, commensurative practices are powerful 
means for coordinating human action and making possible automated deci- 
sion-making. Sophisticated forms of commensuration have transformed our fi- 
nancial markets. Computer programs that continually search for discrepancies 
between stock prices, futures, and options prices have generated new invest- 
ment strategies and have mechanized a broad array of investment decision- 
making. Now, distinctions are made between “discretionary” traders, who rely 
on their own judgment and “system” traders, who rely on mechanically pro- 
duced signals to make decisions (Lucas & Schwartz 1989). 

Techniques for commensurating are not evenly distributed. These patterns 
may reflect longstanding interests in commensuration, where those with the 
most to gain from commensuration have become its most sophisticated practt- 
tioners. Not surprisingly, water development agencies had sophisticated meth- 
ods for calculating the benefits of dams long before they devised these for 
costs (Espeland 1993). Other biases exist. Units of analyses are often used that 
obscure the distributional effects of policies. Cost-benefit analyses that “dis- 
count the future” favor immediate benefits and distant costs over long-term 
benefits and immediate costs. This spurs development at the expense of envi- 
ronmental costs (Schnaiberg 1980:334—44). Even more fundamentally, pre- 
suppositions for commensuration often reflect assumptions about commodifi- 
cation that are inherently political and asymmetrical (Radin 1996, Sunstein 
1994). 

And finally, efforts to translate incommensurable values into commensu- 
rated value not only can distort the character of people’s investments but can 
repudiate identities that are closely linked to incommensurable values. 


INCOMMENSURABLES 


Commensuration sometimes transgresses deeply significant moral and cul- 
tural boundaries. Defining something as incommensurate is a special form of 
valuing. Incommensurables preclude trade-offs. An incommensurable cate- 
gory encompasses things that are defined as socially unique in a specific way: 
They are not to be expressed in terms of some other category of value. Follow- 
ing Raz (1986:326—29), we broadly define something as incommensurable 
when we deny that the value of two things 1s comparable. An incommensura- 
ble involves a "failure of transitivity," where neither of two valuable options is 
better than the other and there could exist some other option that is better than 


COMMENSURATION 327 


one but not the other. [Anderson (1995) and Sunstein (1994) offer slightly dif- 
ferent definitions.] 

The importance of incommensurable categories will vary, partly because 
the significance of this symbolic boundary varies. Their salience depends on 
how passionate we feel about them, on their centrality in defining our roles and 
identities, and on how much effort is required to breach them. Their impor- 
tance also depends, as Simmel would argue, on the relative status of their op- 
positional form, commensuration. The extension of commensuration into 
more spheres oflife may make incommensurable categories more meaningful, 
their defense more necessary. This extension may produce paradoxical effects, 
as when "pricing" children in law, labor, and insurance shifted the terms of 
their value from primarily economic to moral and emotional. Children became 
priceless (Zelizer 1985}. 

Sometimes trivial things are in>commensurable. If I cannot choose between 
chocolate cake and lemon pie, and adding whipped cream to the cake doesn't 
make it better or worse, these desserts are formally incommensurable but 
hardly significant for how I understand myself or how I treat others. Some- 
times incommensurables are expressed for purely strategic reasons, as a bar- 
gaining position. One way to get more leverage or a better price during nego- 
tiations is to assert the incommensurability of something. Labeling something 
as bargaining in order to discredit claims can also be a political response 
(Espeland 1998). 

But incommensurables can be vital expressions of core values, signaling to 
people how they should act toward those things. Identities and crucial roles are 
often defined with incommensurable categories. Believing that something is 
incommensurable can qualify one for some kinds of relationships. When in- 
commensurable categories are important for defining how to "be," Raz calls 
them "constitutive incommensurables" (Raz 1986:345—57). People facing a 
choice involving a constitutive incommensurable will often refuse to partici- 
pate; for some, the idea of such a choice is abhorrent. 

For Yavapai residents whose ancestral land was threatened by a proposed 
dam, land was a constitutive incommensurable (Espeland 1998). The Yavapai 
understood themselves in relation to this specific land. Valuing land as an 1n- 
commensurable was closely tied to what it means to be Yavapai. The rational 
decision models used by bureaucrats to evaluate the proposed dam required 
that the various components of the decision be made commensurate, including 
the cost and consequences associated with the forced resettlement of the Yava- 
pai community. This way of representing Yavapai interests and expressing the 
value of their land was a contradiction of those values and of Yavapai identity. 

There are many other, common examples of constitutive incommen- 
surables. Two of Raz’s examples are children and friends. Believing that the 
value of children 1s not comparable to money and that the very idea of ex- 
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changing a child for money is repugnant 1s fundamental to being a good parent. 
The inappropriateness of using commercial means for valuing children is one 
way we define good parenting. Likewise, believing that friendship cannot be 
bought or that what we derive from our friendship with a person is distinctive 
and cannot be had with any other person is basic to what it means to be a good 
friend. Thinking that our friends were somehow interchangeable could keep us 
from having genuine friendships. The pain of selling a childhood home, the re- 
luctance some feel about selling their blood, our disapproval of sex for profit, 
or even faculty qualms over ranking graduate students or evaluating subordi- 
nates compared to “benchmarks” are examples of people grappling with in- 
commensurable categories. Believing in incommensurables is a way to limit 
what can be rationally chosen, and this can be an important social relationship. 

Just as commensuration is a considerable social accomplishment, so too the 
creation of incommensurables requires work. Some party must draw bounda- 
ries around the thing whose value is to be kept, or made, distinctive and then 
defend the boundaries from encroachment. Sometimes these tasks are the pur- 
view of experts: art critics and museum professionals who certify some objects 
as masterworks or as especially worth exhibiting (Becker 1982, Alexander 
1996); attending physicians who invoke clinical wisdom and professional 
privilege to designate some medical cases extraordinary (Bosk 1979). Some- 
times these tasks are the purview of intimate others: the mothers and fathers of 
premature newborns, for example, who are encouraged by hospital staff to 
name their babies, dress them in clothes brought from home, personalize their 
ward cribs with toys and photographs, and otherwise mark their infants as 
unique (Heimer & Staffen 1998). In still other instances the production of in- 
commensurables is the main business of entire organizations, even bureau- 
cratic ones: preservation agencies, for example, that designate official historic 
sites, landmark neighborhoods, and wildlife habitats, as well as the organiza- 
tions that do the grunt work of enforcing the rules. Whether they are priceless 
artworks, national treasures, or precious children, incommensurable things are 
often regarded as somehow sacred, and like all sacred objects, their distinct- 
1veness is defined through symbols and ritual. This marking can be elaborate, 
or mundane: For example, the sequestering of certain cash in a special jar or 
drawer can define it as money for distinctive purposes and thus incommensu- 
rable with other savings (Zelizer 1994). 


STUDYING COMMENSURATION 


Commensuration is a general social process, it is political, and it is capable of 
transforming social relations. It deserves closer, systematic scrutiny. We next 
offer core guiding questions that help reveal variation in how naturalized, how 
fateful, and how resisted commensuration can be. 
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How Institutionalized Is the Commensurative Áct? 


Instances of commensuration vary by how institutionalized they are, that is, 
they vary in how automatically commensuration gets done and in how natural 
the process seems to involved parties. [This conception of institutionalization 
is indebted to Garfinkel 1967, Berger & Luckmann 1966, and the work of John 
Meyer and his colleagues (e.g. Meyer 1971, Zucker 1977, Jepperson 1991).] 
Attending to institutionalization enables us to appreciate the extent to which 
commensuration constructs what 1t measures. 

Some instances of commensuration are so deeply institutionalized that they 
help to constitute what they purport to measure. For example, futures traders 
buy and sell agricultural commodities by virtue of standardized grading sys- 
tems that constitute products for entire industries. Grading systems create ex- 
plicit categories of relative quality, and hence relative value, that make possi- 
ble trade in products that may not yet exist (Cronon 1991:97—147; Porter 
1995:45—48). Rankings of academic institutions, which purport to measure 
relative quality according to some common metric of excellence, sometimes 
prompt members to reevaluate their perceptions of their own schools (Elsbach 
& Kramer 1996). As our earlier example suggests, institutions often respond 
directly to raters' criteria; even if members dispute the accuracy or legitimacy 
of rankings, they are too fateful to ignore. 

Some commensurative practices exist only in theory, such as comparable- 
worth wage programs. Intended to improve chronic income disparities be- 
tween women and men, comparable-worth programs commensurate skill and 
pay levels between traditionally female and traditionally male occupations 
(England 1992). But comparable-worth advocates have met with very little 
success in implementing such policies or even in securing judicial approval of 
them (Nelson & Bridges 1998). Such instances of commensuration are weakly 
institutionalized because so few parties use them. Little more than an argument 
(however good a one), this commensuration effectively exists only on paper. 

But what determines the extent to which a commensurative act gets institu- 
tionalized? Phenomenological sociology suggests a preliminary answer. Ber- 
ger & Luckmann argue that socially constructed meaning becomes more fact- 
like when it is objectivated or reified, that is, when social practices are organ- 
ized to sustain the appearance that meaning stands outside of individual sub- 
jectivity, as part of the world (1966:47—92; Berger 1967:3—24). In keeping 
with this insight, we argue that as commensuration gets built into practical or- 
ganizations of labor and resources, ıt becomes more taken for granted and 
more constitutive of what it measures. Thus, however arbitrary, the Chicago 
Board of Trade's standardized grades of grain quality became ever more con- 
stitutive of what they measured as the number of parties who used the meas- 
ures grew: not only farmers and merchants, but also elevator operators, banks, 
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the trade press, and ultimately the state legislature. In time, Chicago business- 
men could make or lose fortunes trading in futures—commodities that exist 
only by virtue of a commensuration system (Cronon 1991:97—147). 

Institutionalization as reification enables us to make predictions about the 
potential trajectory of other commensurative practices. We might expect, for 
example, that college rankings will become more constitutive of what they 
measure as their audiences expand: parents considering where to send their tui- 
tion dollars, faculty plotting careers at prestigious schools, and foundations 
whose grant-giving attends to such measures of institutional quality. On the 
other hand, commensurative acts that fail to get etched into practice, such as 
comparable-worth policies, will remain the purview of academic specialists 
and disappointed reformers. 


How Does Commensuration Refract Power Relations? 


Some proponents see commensuration as a technology of inclusion. This 
makes it especially valuable in democratic, pluralistic societies (Stokey & 
Zeckhauser 1978). Commensuration offers an adaptive, broadly legitimate de- 
vice for conferring a formal parity in an unequal world; for pragmatic reform- 
ers, this is a hopeful beginning (Espeland 1998, Brown 1984). In decisions 
characterized by disparate values, diverse forms of knowledge, and the wish to 
incorporate people's preferences, commensuration offers a rigorous method 
for democratizing decisions and sharing power. 

For supporters, the discipline of commensuration creates robust, “objec- 
tive" knowledge that can constrain power. For example, Marx used the “moral 
statistics" of his day as essential weapons in his indictment of capitalism; We- 
ber (1978:225) saw commensuration facilitating the leveling effects of bureau- 
cratic rationality by providing sturdy mechanisms for challenging old forms of 
privilege; today, discrimination is often fought most effectively with numbers, 
by lawyers girded with statistics; and when standardized tests are used in hır- 
ing decisions, the odds for minorities can improve (Neckerman & Kirschen- 
man 1991). 

Critics of commensuration come from both the right and left. Conservatives 
disdain its equalizing effects, the loss of elite discretion that it fosters. Left- 
leaning critics see commensuration as another conduit of power that mystifies 
power relations, partly by emphasizing results at the expense of process and 
distribution (e.g. Tribe 1971, 1972). Commensuration, in propelling “deci- 
sionism," helps sustain the pretense that facts and values can be separated, that 
politics can be rendered technical (Habermas 1973:253—82). 

But commensuration is not merely a tool of the powerful, a way to wage in- 
terest politics numerically. Porter (1995) argues that recourse to quantitative 
methods evinces weak authority. The spread of quantitative expertise repre- 
sents a quest for “mechanical objectivity"—knowledge whose authority is 
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based on close adherence to quantitative rules. Mechanical objectivity is most 
valued when decision-making is dispersed, when it incorporates diverse 
groups, when powerful outsiders must be accounted to, when decisions are 
public and politicized, and when decision-makers are distrusted. The legiti- 
macy offered by numbers diminishes autonomy, because discretion is replaced 
by disciplined methods. This is why quantitative technologies are the province 
of weak elites and why they are resisted by those whose authority depends on 
expert judgment, character, or informal knowledge. 

Understanding commensuration as a calculus of power requires that we ap- 
preciate the various guises of power, whether these are obvious or opaque, 
strategic or constitutive. While examples of numbers malleable enough to con- 
form to powerful interests are easy to find (e.g. Delaney 1994), commensura- 
tion, once launched, can become hard to control. Strategic commensuration, 
our capacity to create numbers that reflect our will, is perhaps greatest when 
commensuration is less public and less accessible and when methods are new 
or not grounded in academic theory (which creates new partisans). Those who 
think they can manipulate numbers at will are often proved wrong in the long 
run. 

Commensuration's constitutive power is perhaps an even more formidable 
force, altering the people and places where it intrudes. The capacity to create 
new categories and enforce mechanical objectivity are consequential powers, 
ones often associated with states or firms. Official statistics may be more im- 
portant for the subjects they create (*Hispanics," “the unemployed," “gifted 
children") than for the technical advantage this knowledge confers. Once the 
categories are in place, people's behavior increasingly conforms to them. This 
1s not the obvious power of coercion but the more elusive, passive power of 
discipline, increasingly self-inflicted. The validity of censuses, test scores, or 
public opinion polls requires complicity from their subjects. Individuals are 
made governable (Foucault 1991:87—104) and numbers become self-vindi- 
cating (Porter 1995:45) when measures guide the activities being measured or 
shape the images of those whose characteristics they measure. 

Commensuration produces depersonalized, public forms of knowledge that 
are often deemed superior to private, particularistic forms of knowing (Reddy 
1984). The authority of those who know most about something can be under- 
mined by the rigorous methods of distant, if less informed, officials. For exam- 
ple, before measures were standardized by states, regions and villages often 
had their own distinctive measures. Such heterogeneity in measurement en- 
hanced the salience of local knowledge and facilitated negotiability. A “just 
price” for a unit of grain could be accomplished by peasant strategies for ma- 
nipulating how densely packed it was. This flexibility favored local interests 
over state powers; hence rulers often eagerly imposed new, standardized 
measures (Kula 1986). 
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Commensuration refracts power in many ways. It can enlarge decision- 
making or legitimate preordained decisions. It can be cynically manipulated 
by elites or it can limit their discretion. It can create disciplined subjectivities 
or arm dissenters. This variety makes commensuration a useful lens for inves- 
tigating the multiple forms of power. 


When Are Claims about Incommensurables Made? 


Perhaps because of their ability to constitute value and alter power relations, 
some instances of commensuration generate discontent. Claims that some val- 
ues are incommensurable—that they cannot or should not be ordinarily com- 
pared with other values—are not uncommon. Nor are they random. We hy- 
pothesize that the most frequent and most durable claims about incommensur- 
ability occur at the borderlands between institutional spheres, where different 
modes of valuing overlap and conflict. We suspect also that claims about 1n- 
commensurables are likely when commensuration threatens some cherished 
identity. 

Friedland & Alford (1991:232) define institutions as both supraorganiza- 
tional patterns of activity and symbolic systems through which we give mean- 
ing to activity. Because societies are complexes of multiple institutions, they 
are characterized by multiple modes of valuing. We value monetarily when we 
enter a labor or commodity market; emotionally when espousing friendship or 
love for children or a mate; and bureaucratically when we gauge merit or fault 
by reference to formal rules. These different modes of valuing are not neces- 
sarily consistent with one another. A job that pays well may estrange us from 
loved ones if it requires a move to another city. Meticulous devotion to formal 
rules may make us adequate bureaucrats but horrible friends (Heimer 1992). 

Institutional theorists argue that inconsistency and contradiction between 
institutions can be opportunities for social innovation and change (Orren & 
Skowronek 1994, Clemens 1997) but also sites of deep struggle as different 
modes of behaving, cognizing, and valuing conflict (Friedland & Alford 
1991). We suspect that claims about incommensurables are likely to arise at 
the borderlands between institutions, where what counts as an ideal or normal 
mode of valuing is uncertain, and where proponents of a particular mode are 
entrepreneurial. 

Debate surrounding commercial surrogate motherhood provides a clear ex- 
ample of dispute about incommensurables at the borderlands between institu- 
tions. Sometimes called contract pregnancy, commercial surrogate mother- 
hood is a reproductive arrangement in which, for a fee, a woman agrees to be- 
come pregnant, carry the child, and relinquish her parental rights after delivery 
(see Anderson 1993:168—69). The practice has generated considerable contro- 
versy among feminists and legal scholars (e.g. Moody-Adams 1991, Satz 
1992, Radin 1996). For its critics, commercial surrogate motherhood is an en- 
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croachment of market modes of valuing into intimate spheres of life. To com- 
bat the encroachment, some argue that “[w]omen’s labor is not a commodity” 
(Anderson 1995:189, original emphasis), nor are the children born of that la- 
bor (Radin 1996:136—53). 

What makes commercial surrogate motherhood a locus of claims about in- 
commensurables? The practice exists in a social space where neither intimate 
nor market modes of valuing are hegemonic. As the legal scholar Margaret 
Radin notes, the distinction between the baby-selling of commercial surrogate 
motherhood and the baby-giving of traditional adoption arrangements—in 
which adoptive parents often wait for a child of a particular race or age and pay 
many costs associated with pregnancy— is a fragile one (Radin 1996:136—53). 
In such uncertain terrain we are likely to find vocal advocates for one or an- 
other mode of valuing, and claims about incommensurables can be viable 
weapons in the struggle to control the contested turf. 

Incommensurables will also be claimed where entrepreneurs of one mode 
of valuing wish to move in to novel terrain. Radin's careful bid for the market 
incommensurability of children (1996), for example, is a direct response to 
celebrated arguments for a market in them (Posner 1992:150—54; Landes & 
Posner 1978; Becker 1981). 

Claims about incommensurables are also likely when commensuration 
threatens a cherished identity. When commensuration seems to discount some 
component of the self, the short-changed may disavow the implicating mode 
of valuing. Like their forbearers in the alternative-school movement (Swidler 
1979), many parents who home school their children are suspicious of letter 
grading and formal achievement tests that enable their children's skills to be 
compared quantitatively with those of other children. Deeply protective of the 
individuality of their children, home schoolers fear that standardized perform- 
ance measures at best prevent, at worst erode, a conception of children as 
uniquely gifted persons (Stevens 1996). 

Because collective identities are often defined symbolically, efforts to com- 
mensurate symbolic objects with other valuables can meet with fierce resis- 
tance. Because geographic territory is often deeply symbolic of national iden- 
tity, for example, disputes over territorial sovereignty are often long and bitter. 
The impassioned territorial commitments of Israeli and Palestinian peoples 
have confounded countless efforts to commensurate territorial interests at dip- 
lomatic bargaining tables (Friedland & Hecht 1996). 

That claims about incommensurables are sometimes made by parties who 
may risk loss suggests that such claims may be more strategic than constitu- 
tive. It is tempting to infer that claims of incommensurability are themselves a 
kind of bargaining strategy, akin to bluffing in a poker game to cover a bad 
hand or to up an opponent's ante. Surely some claims of incommensurability 
are strategic in this way. But claims about incommensurables may also simul- 
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taneously reflect deeply held convictions and clever bet-hedging. People who 
lose their community to an industrial disaster may find the symbolic void irre- 
placeable but will also use that loss as grounds for material compensation 
(Erikson 1976). Disentangling the constitutive from the strategic in claims 
about incommensurables requires careful empirical work and recognition that 
people often have multiple and even contradictory incentives. 


COMMENSURATION IN ACTION 


We believe that attention to commensuration provides novel insights into es- 
tablished fields of sociological inquiry. To conclude, we illustrate how such an 
analytic focus might inform work in three broad substantive areas: gender and 
work, politics and social movements, and institutional sociology. Our goal is 
to sketch the potential utility of this way of theorizing in order to encourage 
further and more systematic efforts. 


Feminist Commensuration and Its Discontents 


That commensuration has the potential to transform what it measures is dem- 
onstrated in the repeated efforts of feminists to value household work in met- 
rics used to quantify paid labor. Nineteenth-century reformers (Siegel 1994), 
twentieth-century feminists, and social scientists have sought to re-value the 
typically unpaid cooking, cleaning, child-rearing, and household management 
tasks women do for their families in metrics of time and output. The goals of 
such efforts have been multiple: to implicate housework in broader critiques of 
capitalism (Luxton 1980, Hartmann 1981); to quantify unequal distributions 
of domestic work between men and women (e.g. Walker & Woods 1976, 
Hochschild 1989); to argue for paid housework (Oakley 1976:226); and to em- 
phasize how much of housework is a low-status chore (Mainardi 1970). How- 
ever, some women have been reluctant to commensurate their own home work 
with paid labor, which suggests both a symbolic boundary around domestic re- 
lationships and a fissure between feminist and “pro-family” women that the 
analytic lens of commensuration can help to define. 

Central to “modern” conceptions of family (see Stacey 1990:3-19) is the 
belief that family relations are of a fundamentally different character than 
those of the marketplace: Families are havens partly because relations among 
family members are governed by something more than self-interested individ- 
ual calculation (Lasch 1977). If families are partly defined by their nonmarket 
exchanges, then attempts to commensurate these exchanges with labor market 
transactions may undermine the distinctiveness of familial relations. If house- 
hold work is made formally commensurate with other forms of paid labor, then 
families appear more like the nodes of resource agglomeration, consumption, 
and social reproduction that some economic theory imagines (Becker 1981) 


COMMENSURATION 335 


and less like the havens envisioned by *pro-family" movements both historical 
and contemporary (e.g. Ryan 1981, Martin 1996). 

The transformative potential of commensurating housework with other 
kinds of labor is double-edged. Feminists who have advccated direct compari- 
sons have done so in order to alter women's relationships to other family mem- 
bers and to the broader labor force. By encouraging housewives to think of 
themselves as workers, laboring under oppressive conditions, early liberal and 
radical feminists sought to change women's appraisals of their household 
situations in ways that would incite them to domestic activism: At least, men 
would do more and women less at home than the moderr-traditional rules pre- 
scribed (Hole & Levine 1971:85); at most, more equitable allocations of 
housework and childrearing would allow radical new models of family (Fire- 
stone 1970). The commensuration of housework with other kinds of labor has 
helped feminists to argue convincingly that gender asymmetries in the division 
of domestic work unjustly constrain women's lives. 

Other women have resisted workplace modes of valuing at home. Oppo- 
nents ofthe Equal Rights Amendment (ERA) resisted the legislation partly be- 
cause of how its advocates conceived of housework. The domestic arrange- 
ments that feminists found so impoverishing were a way of life for millions of 
women. Commensurating housewifery with other occupations and declaring it 
wanting further undermined the already eroding status of angry homemakers 
(Mansbridge 1986:90—117). Within the abortion controversy, many pro-life 
activists object to the commensuration of motherhood with paid-work occupa- 
tions. Many full-time homemakers believe that workplace logic diminishes 
them, that their lives measured poorly on metrics of income, occupational at- 
tainment, and personal autonomy (Luker 1984:158—215). 

By commensurating housework with paid labor, feminists sought to trans- 
form both social appraisals and the social organization of domestic work. By 
most accounts, however, their efforts are only weakly institutionalized. Femi- 
nists have succeeded in demonstrating the low status oT housework and in al- 
tering the life expectations of many women. But as Hochschild (1989:12) 
states succinctly, "There has been a real change in women without much 
change in anything else": Men contribute only minimally more to household 
duties, workplaces only reluctantly accommodate employees' family de- 
mands, and childcare remains a domestic, not a public or corporate, obliga- 
tion. And as the ERA and abortion battles make clear, feminist efforts to com- 


Sin childcare, commensuration has also directed attention to other sorts of distinctions quality 
vs quantity time, individualized attention vs group socialization; davcare or preschool, a nearby 
relative, a certified caregiver, or an imported au pair With housework less emotionally loaded than 
childcare, fewer distinctions seem necessary. Nevertheless, commensuration remains contested in 
both arenas. 
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mensurate domesticity have generated concerted arguments about incommen- 
surables. Women invested 1n domesticity have found some of the feminist 
equations deep threats to their identities. 


Politics and Social Movements 


Commensuration makes possible modern politics. Opinion polls, in eliciting 
and organizing attitudes, create the object we call public opinion (Herbst 
1998). Politics, as the art of compromise, is a broad instance of commensura- 
tion. Political negotiation entails seeing one’s own interests as comparable to 
the interests of others. Our conception of interests as a basic unit of political 
analysis implies commensuration. When political disputes are framed as a 
contest over interests, parties are granted a formal, categorical equality among 
those with a political stake. Interest-group politics portrays outcomes as if dif- 
ferences were a matter of magnitude—of how much something matters, or of 
whose interests were served—trather than as disparate modes of investment in 
the decision. Voting is one way to commensurate interests. Trading—of cam- 
paign dollars for a sympathetic ear in office, of my vote on your project for 
yours on mine (the essence of pork-barrel politics), of tit for tat at the bargain- 
ing tables where multiple interest groups attempt to forge mutually advanta- 
geous coalitions—requires that traders evaluate diverse interests along some 
shared order of magnitude. Such commensurative acts are at the heart of nor- 
mal politics, explaining puzzles such as why we have so many dams (Reisner 
1986) or why tax reform requires sports stadiums (Birnbaum & Murray 1987). 
Making qualitatively unlike interests comparable can be a formidable cogni- 
tive achievement; that politicians, campaign contributors, and rank-and-file 
voters do such commensurating all the time is testament to the extent to which 
the equation of diverse values is commonplace in modern life. (Of course, the 
mode of commensuration matters here: Trading votes is regarded as accept- 
able political behavior; buying and selling them is not.) 

But as many social-movement activists discovered, commensurative poli- 
tics brings its own quandaries. Many New Left student activists of the 1960s 
and 1970s avoided participation ın normal party and electoral politics because 
they believed that the structures of those institutions were morally flawed. 
Some New Left activists equated negotiation and trading of interests with 
moral compromise. For them, conventional political activism was suspect pre- 
cisely because it required trade-offs among inviolable interests and illegiti- 
mate ones. The New Left’s “great refusal” to participate in the commensura- 
tive art of normal politics has been cited both as its greatest moral accomplish- 
ment (Breines 1989) and as a cause of its ultimate political weakness (e.g. Git- 
lin 1987). 

The recent history of the Religious Right indicates just how consequential 
choices to commensurate interests can be. Although a few conservative Chris- 
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tian leaders have long advocated translating the faith into political activism 
(Ribuffo 1983), only relatively recently did large numbers of rank-and-file be- 
lievers begin to conceive of themselves as distinctive players in the realm of 
normal politics (Himmelstein 1990). A great accomplishment of conservative 
Christian leaders since the 1970s has been convincing many rank-and-file 
Christians to enact their faith in the political arena: Doing so has obliged be- 
lievers to consider the comparability of their faith-based interests with the 
more secular agendas of other conservative factions (e.g. Klatch 1987, Rozell 
& Wilcox 1996). Typically cultural-traditionalists, conservative Christians co- 
operated with libertarian and economic conservatives to win three consecutive 
Republican presidencies (e.g. Himmelstein 1990, Martin 1996). But the be- 
lievers were dismayed when their unequivocal stances on abortion, school 
prayer, and homosexuality became compromisable interests at Washington 
bargaining tables (Diamond 1995, Martin 1996). The powerful Christian Coa- 
lition has recently confronted a difficult choice: holding close to policy posi- 
tions dear to conservative Christians or becoming more flexible in its stances 
on abortion, homosexuality, and other divisive issues in order to cooperate 
with other interest groups and a wider array of politicians (e.g. Reed 1993). 
Throughout its recent past, then, the Religious Right has wrestled with 
whether, and how, to commensurate its faith-based commitments with the 
secular parties and profane interests it encounters in the broader political 
arena. 

That some movement activists from left to right have been wary of the com- 
promises normal politics requires suggests their awareness of the transforma- 
tive potential inherent in commensurating disparate values. When we opt to 
negotiate with parties who do not share our vision of the world (e.g. members 
of the *Establishment," those not born again), we risk alienation of our inter- 
ests. Negotiation requires commensurating with the enemy: It requires com- 
paring the cherished with the reprehensible in ways that make the former less 
distinctive, less incomparably valuable than it once was. Not surprisingly, 
movements that stake their identities on incommensurables—radical democ- 
racy, heavenly truths, and native lands, for examples—face a dilemma even 
coming to the bargaining table. 

(Of course, sometimes social movements embrace commensuration as a le- 
gitimating device. For women's reform organizations during the Progressive 
Era, the substitution of money for personal service was a way for women to 


ÓJust as commensuration creates new social relations, so too does creating incommensurables. 
Not all mcommensurables carry the same cultural weight, but some things defined as 
incommensurable may be subject to distinct rules of conduct. For example, family heirlooms 
bestowed on particular persons are often subject to special uses and, except under extraordinary 
conditions, are removed from markets. 
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signal that theirs were serious, modern organizations. As Clemens (1997: 
209-10) argues, these women understood that citizenship required cash.) 


Institutional Sociology 


The ability of commensuration to create new social relations and even new so- 
cial entities is clear in recent work by institutional sociologists. Studies of the 
elaboration and worldwide diffusion of census activity (Ventresca 1995) and 
of formal accounting procedures (Meyer 1986, Miller & O'Leary 1987, Car- 
ruthers & Espeland 1991) show how particular measuring, recording, and 
ranking processes help to make and remake phenomena they ostensibly de- 
scribe. 

Ventresca (1995) argues that the worldwide diffusion of relatively stan- 
dardized modes of census administration helped render different parts of the 
world formally comparable. Shared counting procedures help shape how dif- 
ferent populations make sense of one another and of themselves. With similar 
censuses, societies with wildly disparate histories, cultures, and economic and 
political structures are made to seem easily comparable. Vital statistics on 
scores of nations can be aggregated, summarized, and ordinally ranked—on a 
single page—facilitating charitable, diplomatic, and market linkages across 
vast stretches of social and geographic space. 

Studies of accountancy offer parallel pictures of commensuration practices 
that make qualitative unlikes quantitatively comparable. Standardized ac- 
counting procedures make a firm's varied assets and liabilities, from raw mate- 
rials to workers, uniformly calculable in monetary terms (Miller & O'Leary 
1987) so as to produce values like “net worth." Like census figures, net worth 
1s easily compared across firms (Carruthers & Espeland 1991). Such compara- 
bility permits us to understand firms as financial portfolios rather than as pro- 
ductive units. With accountants busy creating comparable bottom lines, execu- 
tives can buy and sell firms while focusing on their profitability rather than on 
what they produce (Espeland & Hirsch 1990, Fligstein 1991). 

But commensuration does more than produce new relations. It can also pro- 
duce new entities. Common to these quite different studies of censuses and ac- 
counting procedures is the notion, informed in part by the work of Foucault 
(1973, 1977, 1978), that preponderant administrative practices create what 
they purport to describe. For example, Ventresca argues that modern census 
procedures help to create the nation-states they quantify. Censuses define the 
boundaries of state sovereignty by specifying just who is within those bounda- 
ries and who is not. The very structure of a census as an official count of per- 
sons assumes an aggregate relationship between nation and individual—the 
nation-state is the individuals 1t counts. Censuses also reify these individuals, 
marking them as non-, quasi-, or full citizens of a particular state and lending 
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broad cultural salience to those facets of individual identity about which cen- 
sus counters, and their questionnaires, query (Ventresca 1995). In rendering 
nation-states more comparable, censuses also constitute what they compare 
(Desrosiéres 1990). 

Conceptually similar processes characterize the rise of formal accounting 
procedures. Accountants, promising information that will improve efficiency, 
have historically sought ever more elaborate means of measuring labor output 
and labor costs; such measurements enable designation of modal and optimal 
levels of productivity against which many workers can be ranked and com- 
pared. Accounts thus help to construct such organizationally consequential be- 
ings as the average worker, the ideal worker, and the suboptimal worker 
(Miller & O'Leary 1987). 

Social critics from Simmel to Foucault have sought to portray how mod- 
ernization reconstitutes human subjectivity and transforms long-established 
social relations. Examining particular instances of commensuration may en- 
able institutional scholars to better discern the mechanics of those changes. 
Recent theoretical work underscores this potential. Neoinstitutionalists opera- 
tionalize modernity as a “Western cultural account," global in scope, that 
among other things assumes the calculability of all social values. In that mod- 
ern story, human progress is incremental: Only by measuring can individuals 
or nation-states know how they are faring in personal or global history (Meyer 
et al 1994). Acts of commensuration facilitate comparative measurement 
across vast differences of sentiment, person, kind, culture, and nation. Ration- 
alist, imperialist, and at times transformative, they may be key ways that we 
make ourselves modern. 
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ABSTRACT 


The recent cultural turn in American sociology has inspired a number of 
more scientifically oriented scholars to study the meanings that are embed- 
ded within institutions, practices, and cultural artifacts. I focus here on re- 
search that (a) emphesizes institutional (rather than individual) meanings, 
(b) uses a structural approach to interpretation, and (c) employs formal algo- 
rithms or quantitative procedures for reducing the complexity of meanings to 
simpler structural prirciples. I discuss two core methodological issues—the 
assessment of similarities and differences between items in a cultural system 
and the process by which structure-preserving simplifications are found in 
the data. I also highlight the importance of two-mode analytic procedures 
and I review some of the perceived benefits and criticisms of this style of re- 
search. 


INTRODUCTION 


The cultural turn that has recently swept through much of American sociology 
has meant that sociologists are ever more frequently focusing on the role of 
symbols, meanings, texts, cultural frames, and cognitive schemas in their theo- 
rizations of social processes and institutions. Although this resurgence of in- 
terest in cultural phenomena is often associated with the shift towards more 
humanistic and interpretative methodologies, an increasing number of quanti- 
tatively oriented scholars have also begun to turn their attention to the study of 
cultural meanings. In the process a new body of research has begun to emerge 
in which social practices, classificatory distinctions, and cultural artifacts of 
various sorts are being formally analyzed in order to reveal underlying struc- 
tures of meaning. 
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This work is scattered across many substantive areas of research. It has no 
coherent center or easily definable boundaries. There is, as yet, no clearly ar- 
ticulated core methodology or statement of theoretical intent. As a conse- 
quence, I adopt a somewhat idiosyncratic set of inclusion rules for the material 
covered in this review. My focus emphasizes studies that measure institutional 
(or cultural) rather than individual meanings. I pay special attention to those 
projects that have (1f only implicitly) adopted some variant of a structuralist 
approach to interpretation. And I devote particular attention to describing the 
ways in which formal analytic methodologies are being employed to reduce 
complex collections of cultural data to simpler, more easily intelligible struc- 
tures of meaning. 

I begin with a short discussion of how the measurement of meaning devel- 
oped in American sociology up to and including recent attempts to study insti- 
tutional processes through the formal analysis of meaning structures. I then 
highlight the main elements of a structural approach to meaning and describe 
two methodological issues that are involved in conducting this type of re- 
search—the measurement of similarities and differences, and the reduction of 
complexity through various types of formal analyses. I focus next on one par- 
ticularly promising class of analytical methods that seems to me to hold out the 
greatest promise for future progress. I end with a brief discussion of some of 
the advantages and disadvantages of formal approaches to interpreting mean- 
Ings. 


MEANING AND MEASUREMENT IN CONTEXT 


Formal analysis of meaning structures is not new to the social sciences. Much 
important research has been done by scholars in other disciplines, including 
the pioneering work on semantic differential techniques by the psychologist 
Charles Osgood and his colleagues (Snider & Osgood 1969). Cognitive psy- 
chologists have carried this trajectory forward in a myriad of ways. Applica- 
tions and extensions of this work have been pursued by linguists, political sci- 
entists, market researchers, and anthropologists. The latter group in particular, 
and especially cognitive anthropologists such as Roy D’ Andrade (D’ Andrade 
1995), have been especially instrumental in developing the theory and method 
of meaning analysis as a formal endeavor. 

Formal analysis of meaning has also had a long-standing home 1n at least 
two areas of American sociology. Meaning measurement has been a central 
concern of sociologists who use survey methods to study opinions, attitudes, 
and beliefs (Sudman et al 1996). Content analysis of meanings in textual data 
is a second area that has been systematically developed since the early work of 
Bernard Berelson (Berelson 1952). However, with the exception of a few sub- 
fields such as political sociology, where opinion research continues to be im- 
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portant, both of these methodological projects have become isolated from the 
core research traditions of sociological work. This is partly because the data 
sources (opinion surveys and coded texts) have been difficult to acquire and 
are generally limited to specialized areas of investigation. Many quantitative 
sociologists also seem to believe that the dynamics of social structure (formal 
organizations, social movements, processes of social mobility and status at- 
tainment, and the like) can be measured, but that the more ephemeral aspects 
of cultural meanings cannot. Thus, the meaningful character of social action 
(and of institutional life more generally) has appeared as part of the broader 
theoretical context which frames these research programs rather than as some- 
thing that is measured directly. 

Recent work has begun to bridge the divide between culture and social 
structure. The flourishing of symbolic interactionism and ethnomethodology 
during the sixties contributed to an enduring and widespread appreciation of 
the socially constructed character of the social world (Berger & Luckman 
1967). Subsequent theoretical statements almost universally emphasized the 
dialectical or dualistic relationship between cultural meanings and social 
structures (Bauman 1973, Sahlins 1976, Bourdieu 1977, Giddens 1984, 
Swidler 1986, Sewell 1992). Slowly, these insights have come to be incorpo- 
rated into the empirical projects that characterize core regions of the discipline. 
Culture was first linked to the logic of the production process in popular cul- 
ture industries (Peterson & Berger 1975, Peterson 1976). It was shown to be an 
important and measurable factor in the prediction of status attainment out- 
comes (DiMaggio 1982, DiMaggio & Mohr 1985, Mohr & DiMaggio 1995). 
Institutionalists demonstrated the significance of symbols and cultural pro- 
cesses in the study of organizations and their environments (Meyer & Rowan 
1977, DiMaggio & Powell 1991). Social movement theorists showed the im- 
portance of cognitive frames (Snow et al 1986, Eyerman & Jamison 1991) and 
cultural processes of identity formation (Darnovsky et al 1995). Many other 
examples could be pointed to. 

However, the study of cultural phenomena is not the same as the analysis of 
meaning. As Wendy Griswold has persistently complained, most of the re- 
search continues to sidestep the problem of meaning analysis altogether. The 
distribution of genres has been mapped, elements of cultural production have 
been counted, levels of cultural knowledge have been measured, increases of 
organizational homogeneity have been demonstrated, but the meanings that 
are constitutive of these cultural phenomena have largely been left aside. Gris- 
wold has sought to develop methodological approaches that incorporate soci- 
ology’s empirical rigor and sophisticated understanding of social structure 
while also taking the meanings embedded within literary and other cultural 
texts as a critical element of the analysis (Griswold 1987a,b, 1992, 1993). 
While she has been instrumental in focusing attention on this problem, Gris- 
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wold's own work has largely been directed toward the study of literature and, 
although she is rigorously empirical, the formal measurement of meaning has 
not been her goal. 


ACTORS, ACTIONS, AND OBJECTS OF ACTION: A 
NEWER INSTITUTIONALISM 


In this essay, I review many examples of sociological work that seeks to di- 
rectly measure cultural meanings. My focus will be largely methodological in 
that I will organize the discussion around the question of how this type of re- 
search is conducted. But I begin with three recent examples that demonstrate 
the potential contributions of meaning measurement for the empirical investi- 
gation of core sociological questions. What sets this work apart is the folding 
together of cultural meanings and social structures as primary elements within 
the same research design. Such an approach much more faithfully reflects 
theorists' contention that social structures and cultural structures are mutually 
constitutive. Thus, in the following examples, meanings are measured in order 
to show how social structures are created. All three examples concern histori- 
cal changes in institutional forms, a reflectioin of the fact that historical sociol- 
ogy has been an especially fertile terrain for this style of research because re- 
searchers are constrained, by necessity, to analyzing meanings embedded 
within texts (Franzosi & Mohr 1997). 

Charles Tilly has been a long-standing leader in this style of research. His 
recent work is especially instructive. Tilly (1997) seeks to explain the parlia- 
mentarization of British politics, that is, how the structure of political influ- 
ence shifted from the local to the national level while the nature of collective 
claim-making was transformed from a reliance on spontaneous acts of (often 
violent) protest toward the more formalized grievance mechanisms of political 
parties and organized social movements. Though this might be viewed as a 
problem of organizational change or of the formal transformation of political 
structure, Tilly sees this as an occasion for cultural analysis. He emphasizes 
the need to understand how the popular meaning of politics changed during 
these years, and especially how conceptions of social rights shifted along with 
ideas about how to make a collective claim in defense of one's rights. His 
analysis relies on textual data, especially verbatim summaries of contempo- 
rary newspaper articles reporting on some 8,000 “contentious gatherings" that 
occurred between 1758 and 1834. 

Tilly argues that shifting relations between claim-makers and claimants de- 
fines the character of political institutions. For his analysis, Tilly divides 
12,000 or so different contentious groups into 64 social categories (farmers, 
friendly societies, workers, masters, constables, militia, parishioners, gentle- 
men, local officials, and so on). He then looks for claim-making relationships 
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among these groups by analyzing his data on contentious gatherings to see 
what actors made what kinds of claims (attacking, donkeying, petitioning, ar- 
resting, applauding, addressing, and other similar efforts) against what other 
actors. A blockmodel analysis, divided by time periods, shows how the mean- 
ing and structure of political action changed dramatically during these years. 

DiMaggio & Mullen (1993) provide a second example. Their study con- 
cerns how American communities went about celebrating National Music 
Week in 1924. Like Tilly, DiMaggio & Mullen are interested in the way power 
and influence are organized and, also like Tilly, they see the social structure of 
community politics as being built up out of shifting systems of meaning. The 
events studied here are not contentious gatherings, however, but formally or- 
ganized community rituals. They collected data on 833 events occurring in 419 
different communities, focusing on the actors, actions, and objects of action in 
each event. In this case, the actors were the event participants (clubs, churches, 
ethnic associations, employee groups, and the like). The catalogue of actions 
included the various types of musical events that were planned—those that 
tended to reinforce status group boundaries (e.g. religious or classical music 
concerts) and those that had a more inclusive focus (band and patriotic music, 
group sings, and so forth). The objects of action were the types of audiences 
that were encouraged to participate: Were they general admission events or 
events that were staged in such a way as to assemble people who were defined 
as members of specific associations, congregations, or as residents of particu- 
lar institutions? 

Once again, it is the relations between these three sets of elements—partici- 
pants, performances, and audiences—that DiMaggio & Mullen use to con- 
struct a measure of the shifting institutional logics of community political 
structure. What they discover are four relatively distinct modes of organizing 
community rituals that they then link to the foundations of political authority 
in each community. They identify these as (a) “rituals of ratification" that tend 
to reaffirm traditional arrangements of the social order; (5) “rituals of commu- 
nitas" that constitute community members as individuals who are unmediated 
by private associations but, nonetheless, members of a collective unity; (c) 
“rituals of civic unity" that constitute people as individual consumers and par- 
ticipants in a local economy that all have a vested interest in supporting; and 
(d) "rituals of incorporation" that identify community members as workers, 
members of ethnic groups, and other diverse collectivities that must be some- 
how integrated into a whole. 

The third example is a study by Mohr & Guerra-Pearson (1998) of commu- 
nity social welfare agencies in New York City during the Progressive Era. 
Here the question is how particular kinds of organizational forms come to be 
institutionalized. Like much work 1n the organizational ecology tradition, 
Mohr & Guerra-Pearson focus on the ways in which different types of organi- 
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zations compete for resource niches by making jurisdictional claims over spe- 
cific regions of institutional space. Unlike work in this tradition, however, the 
focus is explicitly shifted to the interpretations that organizational actors pro- 
pose about the meaning of institutional activities. As before, three sorts of 
foundational elements are studied. Using written descriptions of organiza- 
tional activities for some 600 organizations, Mohr & Guerra-Pearson collected 
information at four different times about status categories that were used to de- 
scribe relief recipients (men, women, boys, girls, children, sailors, travelers, 
the working, able-bodied, and so on); classes of social problems (criminality, 
delinquency, disability, immorality, and the like); and technologies of organ- 
izational action (such as general relief, employment assistance, vocational 
training, and character-building). 

In this study, it is the structure of the organizational environment that is 
mapped. Relations between organizations are measured according to the simi- 
larities of their claims about the region of institutional space in which they seek 
to operate. Similarity among claims is defined by the extent to which organiza- 
tions apply the same technologies to the same status identities, afflicted by the 
same types of social problems. Differences of interpretation arise when alter- 
native combinations are invoked (as when different technologies are purported 
to be effective for treating the same classes of problems). Multidimensional 
scaling enables the organizations' niche locations to be mapped out on the ba- 
sis of the claims that were made about how poverty problems should be inter- 
preted and addressed. Using this approach, Mohr & Guerra-Pearson demon- 
strate how settlement houses waged a (losing) battle with social work bureauc- 
racies and other more rationalized organizational forms in their bid to become 
the primary institution for delivering community social welfare services dur- 
ing these years. 

I have begun with these examples for several reasons. First, all three em- 
ploy structural methods for measuring meanings. In Tilly's case, the meanings 
concern collective actors' sense of public rights and their conceptions about 
the appropriate means for securing those rights. DiMaggio & Mullen study the 
meanings of community and sets of emergent norms about how to symboli- 
cally consecrate the social order. Mohr & Guerra-Pearson analyze organiza- 
tional interpretations of community social problems and contesting claims 
about appropriate solutions to those problems. In each case, the relevant sys- 
tems of meanings are studied empirically through an analysis of the relation- 
ships that actors impose on various primitive institutional elements (actors, ac- 
tions and objects of action). Finally, all of these studies make explicit linkages 
between the systems of meanings that are investigated and the social structures 
in which they are embedded. Indeed, the point of each project is to demonstrate 
the ways in which enduring social institutions are explicitly constructed out of 
a complex process of negotiation and contestation over cultural meanings. 
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THE STRUCTURAL APPROACH TO INTERPRETATION 


The key methodological maneuver in these projects can be traced to the struc- 
tural method of interpretation, generally associated with the semiotic, structural- 
ist, and poststructuralist intellectual projects that thrived in Europe beginning 
in the 1960s. Although diverse in both methods and goals, most of this work is 
either in the tradition of or in response to the work of the French linguist, Ferdi- 
nand de Saussure [(1916) 1959]. Structuralism (in this Saussurian tradition) 
was founded on the argument that meaning was constituted through the sys- 
tematic distinctions that differentiate words (or sounds, or signs) from one an- 
other. À number of analytic principles follow from this. Content is seen as be- 
ing fundamentally arbitrary. Patterns of differences within a broader system of 
cultural objects become the focus of analysis. The interpretation of meaning is 
seen to be connected to the analysis of the system of relations that link cultural 
objects. As developed most famously in the writings of the anthropologist 
Claude Lévi-Strauss (1963), much ofthe work of interpretation comes to be di- 
rected toward the identification of underlying (deep) structural principles that 
serve to organize the larger complexity of relational patterns. 

There is an enormous literature that discusses, evaluates, and critiques these 
intellectual projects. Useful introductions are provided by Hawkes (1977), 
Pettit (1977), Kurzweil (1980), Wuthnow (1987), and Caws (1988). Moreover, 
as the discussion in Caws (1988), D'Andrade (1995), and Emirbayer (1998) 
make clear, the developments ofthe Saussurian structural tradition paralleled a 
broad variety of other intellectual movements that were also shifting toward a 
relational mode of analvsis during the same period. The research that I review 
here is (for the most part) only loosely coupled to these literatures and I will not 
spend the time to develop these connections in any detail. I will simply point 
out that most of the work that I discuss in this review adopts (if only implicitly) 
some type of structural approach to the problem of interpreting meaning. 

As an example of this mode of analysis, consider Karen Cerulo's (1988, 
1995) research on national anthems. Cerulo shows that national anthems can 
be analyzed as cultural meaning systems and that the structures that are identi- 
fied in the process can be used for comparative analysis. She accomplishes this 
by treating music as a series of symbolic codes: “melodic codes, phrase codes, 
barmonic codes, form codes, dynamic codes, rhythmic codes, and orchestral 
codes." Musical notes are the elements that, when combined, generate these 
code systems. Hence, it 1s the pattern of relations between the notes that is rele- 
vant; the musical notes themselves lack any specific intrinsic value, rather they 
"derive their meaning according to their placement within a larger system" 
(1988, p. 319). Through a complex and ingenious series of measurements de- 
voted to assessing the relational patterns of the notes, Cerulo maps out the mu- 
sical structure of over 150 national anthems. The “meaning” of the music is 
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thus represented by these structural mappings which, as Cerulo demonstrates, 
closely correspond to the sorts of genre distinctions (e.g. between a march and 
a hymn) that musicologists employ to differentiate these styles of music. See 
Timothy Dowd (1992) for a similar approach to analyzing American popular 
songs. 

Cerulo (1995) proceeds to show that several basic social processes very ef- 
fectively predict the type of musical composition that a specific country is 
likely to adopt. In particular, she shows that position within the world political 
economic system, relationship to neighboring countries, to a specific colonial 
power, and to a cohort of other nations, all influence the character of the musi- 
cal composition that is chosen for a national anthem. She uses a similar type of 
analysis to map out the basic structural characteristics of national flags (by 
looking at how color, pattern, and images are arrayed relationally within the 
flag). Her analysis demonstrates that the same factors influenced the selection 
of these sorts of national symbols as well. 

Cerulo's work illustrates the core principles of the type of structural analy- 
sis that is the focus of this review: (a) basic elements within a cultural system 
are identified, (5) the pattern of relations between these elements is recorded, 
(c) a structural organization 1s identified by applying a pattern-preserving set 
of reductive principles to the system of relations, and (d) the resulting structure 
(which now can be used as a representation for the meaning embedded in the 
cultural system) is reconnected to the institutional context that is being investi- 
gated. To begin such an analysis, three problems need to be solved. First, basic 
elements of the cultural system have to be identified. Second, a relevant sys- 
tem of relations must be identified. Third, the pattern of relations must be re- 
corded. These are not trivial tasks. 

Initial decisions regarding what will count as the cultural elements whose 
relational arrangement will be analyzed is critical to determining the signifi- 
cance of the meaning structures that can be uncovered. Because Cerulo was in- 
terested in identifying aesthetic structures, she turned to expert literatures in 
graphic design and music theory for guidance in selecting relevant elements 
for inclusion. This illustration is generalizable. Every structural analysis must 
begin with the identification of a relevant set of cultural items, and these items 
are never simply available in an immediate fashion. An informed cultural or 
institutional theory is necessary to enable us to notice and to be able to make 
relevant distinctions between the constituent elements. 

The same cautionary note applies to the selection of relational measures. 
Elements of a cultural system can be similar or different in any number of 
ways; the trick is to identify those relations that matter. Some of the most im- 
portant criticisms of the French structuralists were that they frequently got lost 
in their own overly formalistic models of structure. Thus is what Philip Pettit 
described as the dangers of mere "pattern-picking" (Pettit 1977, p. 41). When 
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one reads, for example, Clifford Geertz's (1973) criticisms of Lévi-Strauss, it 
is hard not to be persuaded by his complaints that the structuralists were wrong 
to focus solely on the cultural phenomena in isolation from what we might de- 
scribe as the institutional contexts of culture. This was manifested in their ten- 
dency to treat cultural codes as closed systems. For the structuralists, the logic 
of culture was all too often presumed to lie within the cultural system itself (or, 
as in the case of Lévi-Strauss, within some intrinsic ordering properties of the 
human mind). 

There is a solution to this dilemma, however; relations of elements within a 
cultural system should be determined on the basis of how the elements are 
linked to the practical demands of the institutional system of which they are a 
part. This is an argument that goes under the general heading of practice the- 
ory. Classic statements of this perspective include Bourdieu (1977), Geertz 
(1973), and Giddens (1984). Useful summaries and commentaries can be 
found in Ortner (1994) and Friedland & Alford (1991). The argument 1s that 
any cultural system is structured as an embodiment of the range of activities, 
social conflicts, and moral dilemmas that individuals are compelled to engage 
with as they go about negotiating the sorts of everyday events that confront 
them in their lives. This insight has direct implications for the measuring of 
meaning structures. It suggests that when we think about identifying a set of 
cultural items, asking how they are related to one another, and assessing the 
type of structural model that might be relevant, it is important to begin with the 
question of what type of practical utility such a cultural system plays within a 
concrete institutional setting. Ideally, relations between the cultural elements 
should be assessed by looking at how actors, organizations, or institutions 
make practical use of the cultural distinctions being investigated. 


MEASURING RELATIONS 


Once the relevant elements and relational contexts of a cultural system have 
been identified, measurement can begin. The first task is to compare each ele- 
ment in the cultural system with every other element in terms of the identified 
relationship. Four types of measurement strategies can be distinguished. Simi- 
larities and differences among cultural items can be assessed according to (a) 
subjective judgements, (b) common attributes, (c) relations to others, and (d) 
structural-functional profiles. 


Subjective Similarity 
Two cultural items can be said to be similar to one another to the extent that in- 
dividuals make a cognitive judgement that they are of “the same sort." Be- 


cause they often are faced with the need of collecting data in different language 
situations or from nonliterate respondents, cognitive anthropologists have 
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been especially innovative in devising a great many alternative methods for 
collecting these kinds of data (see Hays 1976, Weller & Romney 1988, D'An- 
drade 1995). For example, respondents are asked to judge the similarity of pairs of 
cultural items directly, or they are asked to select which two out of three items 
are most similar to one another. A third method is to ask the respondent to sort 
cards referring to cultural items into piles of things that "go together." Scores can 
then be aggregated across a number of respondents to derive a set of average 
measures of inter-item similarities. Richard Shweder used these techniques to 
investigate how individuals in different cultures understand the concept of the 
person (Shweder & Bourne 1991). For example, he used pile sort techniques to 
assess how individuals in India organize their understanding of 81 different 
personality traits. Included here were terms translated as “crooked,” “lazy,” 
“obstinate,” “brutally frank,” “loyal,” “harmless,” “fickle,” “contemplative,” 
and so on. The resulting 81 x 81 matrix was used by Shweder & Bourne to 
demonstrate that the respondents’ cultural understanding of these terms was 
structured around generalized dimensions of power and social desirability. 


Attribute Similarity 


Items within a cultural system can also be compared on the basis of the sets of 
selected attributes that they share. This type of technique can also be used in 
collecting interview data. Csikszentmihalyi & Rochberg-Halton (1981) em- 
ployed this strategy in their study of the meaning of household possessions 
among Chicago families. A list of items was first elicited by asking respon- 
dents “What are the things in your home that are special for you?” Interviewers 
then asked about the meanings that were attached to each of these items. This 
resulted in a listing of 1694 objects (collapsed for analysis into 41 types). The 
respondent’s meaningful associations were classified into 37 categories of 
value, such as, the object is special because it is a memento, or because it is part 
of a collection, or because it belonged to a specific relative, or because it re- 
flects a specific achievement, etc. The 41 types of cultural items were then 
compared to one another in terms of the types of valuations associated with 
each. The same general method was used by Mohr & Guerra-Pearson (1998) to 
assess the discursive similarity of community social welfare organizations in 
the article discussed earlier. In this case organizations were compared to one 
another on the basis of the similarity of the claims that they made about the 
types of solution technologies that were seen to be appropriate for classes of 
persons and types of problems. 


Relational Similarity 


Similarity among items within a cultural system can also be assessed by look- 
ing for the presence or absence of various types of social relationships that link 
cultural objects together. Thomas Schweizer (1993) also studied household 
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possessions (in an urban neighborhood in French Polynesia, a group of hunter- 
gatherers in Zaire, and a peasant community in Java). In Schweizer's study, 
however, categories of possessions were seen to be similar or different from 
one another depending upon which individuals in the community possessed 
the same sets of items. So, for example, on the basis of who owned what in 
Papeete, a refrigerator was more like a radio and a bicycle was more like a 
kerosene stove. Thus, Schweizer interprets the status meaning of various mate- 
rial possessions by understanding what members of the community possess 
what types of goods. Also important are measures of similarity based on rela- 
tionships that directly connect the cultural items (or elements) to one another. 
Boorman & Levitt (1983) used this method to study how meanings were or- 
ganized in federal bankruptcy law. They coded the internal citation references 
of the 55 subsections of Article 9 of the Uniform Commercial Code. Thus, 
rather than asking individuals to sort cultural items that went together, Boor- 
man & Levitt asked which subsections of legal code were “substitutable” in 
the sense that they were similarly referenced by other subsections of the legal 
code. This is the same type of measure that was used by Tilly (1997) in his par- 
liamentarization article o measure similarities among categories of collective 
actors. Those groups that had similar profiles of relationships to all other 
groups were linked together in the blockmodel analysis as being structurally 
equivalent. 


Similarity of Structural Function 


In some situations the measurement of similarities and differences has to 1m- 
corporate more complex information. To assess the structural meaning of cul- 
tural items embedded within sequences and narratives, it is necessary to pre- 
serve more than pairwise information on similarities. It is also necessary to 
know how items are located within the broader context of the narrative or se- 
quential ordering of events (Abbott 1983, 1990). For example, most contem- 
porary approaches to content analysis require that words be compared on the 
basis of their syntactic or narrative function within the text. How one should 
gather this data is very much under debate (see Roberts 1989, Franzosi 1989, 
1990, Carley 1993, Carley & Kaufer 1993). In general, however, most strate- 
gies call for the preservation of two types of information—the semantic func- 
tion of a word within a field of meaning (e.g. within a sentence) and the spe- 
cific relationships that occur within that field to other (functionally defined) 
words. For example, Roberto Franzosi proposes that similarities of words 
within a text be recorded in such a way that their functions within a localized 
semantic grammar are preserved. This is an important extension (and formali- 
zation) of Tilly's methods. In his study of contentious gatherings in Italy be- 
tween 1919 and 1922, Franzosi (1997) coded newspaper stories so that various 
categories of agents (labor unionists, owners, brownshirts, police, and various 
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others) are compared to one another according to the extent to which they ful- 
filled the same institutional function (instigators of violence, recipients of vio- 
lence, and so on). To get at this kind of information it is necessary to record 
who does what to whom, and in what sequence. 


FINDING STRUCTURE 


Having collected information regarding the relations of similarity (or differ- 
ence) among a set of items within a cultural or institutional system, the next 
task is to find structure-preserving simplifications that may allow the com- 
plexity of the system to be more easily understood. Ideally, one hopes to iden- 
tify some deeper, simpler, structural logic—that is, a principle or set of princi- 
ples that account for the arrangement of parts within the cultural system (Ort- 
ner 1994). The relations between cultural items are the key to such an investi- 
gation. The analytical task is to discover how these relations are related to one 
another. Structuralist methods are geared toward the identification of transfor- 
mations that allow the relations among the relations to be reduced to more eas- 
ily understandable and or visible patterns. I discuss four general classes of 
methods for accomplishing this: (a) multidimensional scaling and clustering, 
(b) network analysis, (c) Boolean algebra, and (d) sequence analysis. Each of 
these approaches makes use of the relational information in different ways in 
order to highlight various qualities of the meaning structure. 


Multidimensional Scaling and Clustering 


Multidimensional scaling analysis (MDS) is one of the oldest and most widely 
used methods for mapping out the relational system of differences in the meas- 
urement of meanings. Simply stated, an MDS analysis reads in a square matrix 
of similarities or differences (actually half of a square matrix is usually used 
because the input data are often symmetrical) and produces a transformation of 
the data that seeks to locate all of the objects in a common (two- or more di- 
mensional) space in such a way that the similarities in the input matrix are 
transformed into Euclidean distances. An MDS space would thus represent a 
series of objects in such a way that (generally speaking) if two items are similar 
to one another (in the input matrix) then they are located near one another in 
the space. If they are dissimilar, they are located far apart. When analyzed in 
two (or three) dimensions, the items can then be easily plotted in such a way as 
to visually convey the relational structure in which they are embedded. 
Clustering methods have also been used for studying cultural meanings. 
There are a great variety of clustering methods, but all seek to group items to- 
gether according to some algorithmic principle for deciding which are most 
alike given the input matrix of item by item similarities. One of the primary 
differences between clustering methods and MDS is that the latter locates 
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items in a way that takes into account the totality of all relational similarities in 
each iteration, whereas clustering methods tend to connect 1tems to clusters 
that are deemed (by some criteria) to be most similar in a localized (or pair- 
wise) sense. The two methods are often used together (as when the coordinates 
of an MDS space are submitted to a cluster analysis in order to more clearly 
designate subregions within the space). The Sage books on these topics are 
both excellent and still quite useful (Kruskal & Wish 1978, Aldenderfer & 
Blashfield 1984). 

These techniques have been widely used by cognitive anthropologists for 
studying meaning systems. As D'Andrade (1995) explains, the 1970s were a 
time when basic theories about culture as knowledge systems were connected 
to formal methods of analysis. It was during these years that many of the clas- 
sic applications of these techniques to understanding the structure of cultural 
meaning systems were pioneered. For example, Burton & Kirk (1977, 1979) 
used MDS to model gender assumptions in Maasai culture. They asked their 
respondents to think about the personality characteristics of various social 
identities differentiated by age and gender (small boy, older boy, warrior, adult 
male, small girl, older girl, and young adult woman). Using triad tests, they 
collected information on similarities among 13 personality traits (disobedient, 
brave, frank, hardworking, socially competent, playful, lazy, respectful, 
fickle, suspicious, clever, skittish, and stingy). 

Their findings showed that small boys were expected to be playful, success- 
ful, brave, respectful, clever, and disobedient, but as they aged, they were ex- 
pected to also become hardworking and to no longer be disobedient. Once they 
became warriors, however, expectations shifted so that they were expected to 
be realistic and, once again, disobedient, while stinginess was frowned upon. 
Burton & Kirk relate these shifts in cultural beliefs to changes in males' roles 
within the community. For example, as warriors, young men are often on raid- 
ing parties away from home where they are expected to use their own initiative 
and, when appropriate, to disobey the commands that had been given them by 
the elders in the village. They are also expected to work together as a group, re- 
lying on one another for survival, making stinginess a significant liability. The 
analyses show a very different pattern for women. Young adult women are ex- 
pected to be lazy and stingy and they are expected not to be successful, clever, 
or brave. Kirk & Burton note that the shift in normative expectations for men 
and women that occurs after circumcision is striking. For men, the change 
marks a time of ascendant authority and autonomy. For women it marks a clear 
acceleration of negative stereotyping and formal disempowerment. They con- 
clude that a primary function of circumcision rituals in Maasai culture is to 
dramatize and institutionalize differences in power between men and women. 
Numerous other examples from this period of anthropological research could 
be pointed to. Another classic example is the study by Roy D' Andrade and his 
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colleagues in which they used MDS to model the structure of meanings regard- 
ing disease categories among English and Spanish speaking students (D'An- 
drade et al 1972). 

Sociologists have traditionally used MDS techniques to study social net- 
works or social organizational structures, such as in Laumann & Knoke’s 
(1987) study of the structure of policy domains. More recently, sociologists 
have begun employing MDS analyses to map out cultural meaning structures 
in a fashion that is more reminiscent of the work of anthropologists. For exam- 
ple, Ennis (1992) and Cappell & Guterbock (1992) used MDS techniques to 
model the structure of American sociological specialties. Ventresca & Yin 
(1997) used MDS to analyze the meaning of social categories in a broad his- 
torical sample of national census surveys. Mohr (1998) uses MDS techniques 
to analyze gendered assumptions in the structure of poverty classifications 
used by Progressive Era social reformers. 


Network Analysis 


The one subfield within American sociology that has always been thoroughly 
structuralist (in the relational sense) is social network analysis. Typically in 
this research tradition, individuals are the nodes, and various types of social re- 
lationships that link individuals to one another (friendship, kinship, social ex- 
changes of various sorts) constitute the "ties" out of which a pattern of social 
organization is constructed. Over the years, network scholars have developed 
an enormous array of conceptual and methodological tools for thinking about 
the structural properties of social networks. The methods can be divided into 
two general types—those emphasizing connectivity and those focusing on 
structural equivalence (Burt 1978, Wasserman & Faust 1994). The former are 
largely concerned with how the relationships between individuals in a social 
network are mediated by the structure of ties that directly connects them. Con- 
cepts like network centrality, between-ness, and adjacency are especially im- 
portant here. The latter approach focuses on ways in which individuals stand in 
structurally equivalent positions because they share common patterns of rela- 
tionships to all others in the network. The key concept here is the idea of a 
structural role (White et al 1976). While most of the empirical work has been 
directed to understanding the properties of social networks, these methods can 
also be extremely useful in the study of cultural meaning structures. 
Kathleen Carley has been one of the pioneers in applying network method- 
ologies to the study of cultural phenomena, especially in her work on textual 
analysis (Carley 1986, 1993, Carley & Kaufer 1993). Carley uses network 
analysis techniques to map out the structure of meanings within narratives. 
Like Cerulo, Carley then proceeds to use these structural representations of 
meanings to compare cultural phenomena. For example, Carley (1994) uses 
these techniques to map the structure of meaning in 30 science-fiction novels 
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written during different periods of history. She codes information on ways in 
which the authors describe the features, actions, and other characters’ percep- 
tions of robots. By studying the relationships between the features that were at- 
tributed to robots by the novels' authors as a network of concepts, Carley is 
able to represent changes in the meaning structure of how robots were por- 
trayed over time. Prior to 1950 they were described as dangerous nonmetallic 
humanoids composed of batteries, electron tubes, and human parts that in- 
spired fear, anger, hatred, and pity in others. By the 1980s they were instead 
being described as clever, loyal, curious, and capable of sarcasm, being embar- 
rassed, making love, programming computers, and dreaming. 

A very different example can be found in Hammond's (1972) analysis of 
the ruins of Mayan architecture. He employed network models of connectivity 
(between plazas) as a way of studying how public space was used. On the basis 
of these models, Hammond was able to deduce which plaza served as the cen- 
tral marketplace and to postulate that the Mayan ballcourt game was, initially 
at least, a sport reserved for an elite audience. 

Structural equivalence approaches to network analysis have also be used to 
understand cultural phenomena. Mohr (1994) uses this method to map the struc- 
ture of moral discourse within which poverty relief agencies differentiated 
among various classes of gendered status identities (mothers, soldiers, widows, 
working men, the blind, and various others) during the Progressive Era. Here, it 
is the status identities themselves that are the 1tems within the cultural system. 
Relations among identities were measured by assessing the profile of relief 
practices that were applied to each (58 different practices were investigated, 
e.g. some classes ofthe poor were given money by the state, others were placed 
in the poorhouse, some were given counseling by the church, others were 
given job training by private agencies, and so on). A block-model analysis 
showed that it was possible to identify different role positions within the sys- 
tem of moral discourse that closely corresponded to Theda Skocpol's (1992) 
arguments about American welfare politics during these years. The discourse 
role analysis was also shown to be an effective predictor of the likelihood that 
any given status category would be described with morally coded terms. 

Other examples can also be pointed to. Peter Bearman (1993) uses struc- 
tural equivalence to show how changes in the social organization of elites in 
16th-century England led to the emergence of abstract religious and constitu- 
tionalist rhetorics. Anheier & Gerhards (1991a) use structural equivalence 
measures ofthe role structure of German writers to explain the development of 
cultural myths about writers—the myth of the writer as a “notorious loner,” as 
"a poor poet,” as "the misunderstood genius," and member ofa “romantic liter- 
ary circle" are all shown to correspond to the social organization of German 
writers. Finally, Patrick Doreian (1987) uses structural equivalence measures 
to reanalyze Hammond's data on Mayan architecture. 
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Boolean Algebra 


Boolean algebra is an increasingly popular method for formally analyzing 
qualitative data. Charles Ragin (1987) has been instrumental in bringing these 
methods to the attention of sociologists in recent years. This approach (also 
known as qualitative comparative analysis, or QCA) begins with a set number 
of cases that vary in terms of the presence or absence of some outcome (or 
product). Cases are grouped according to their profile of features. Boolean al- 
gebra is then applied to identify logically irreducible and nonredundant combi- 
nations of features that are associated with certain outcomes. As in network 
methods, the method presumes a finite number of cases within a system (rather 
than a sample from a population), and the focus is on qualities (presence and 
absence rather than measured quantities) of the features. Unlike the methods 
discussed above, the usual goal of a Boolean analysis is to ascertain which fea- 
tures of a set of cases have a causal relationship with some other feature (or 
outcome) that is being explained. In this sense, QCA is similar to logistic re- 
gression analysis, although as Ragin explains (1995), there are variety of rea- 
sons why one might prefer using QCA, including the fact that it can be applied 
with many fewer cases. 

Because of the emphasis on hypothesis-testing, QCA has generally not 
been applied to the analysis of meaning structures in the same manner as MDS 
or network techniques. There are some exceptions. Degenne & Lebeaux (1996) 
use Boolean techniques to analyze the structure of belief systems. Using French 
survey data they show a causal ordering of religious beliefs and practices (re- 
spondents who pray regularly also go to church regularly or they "believe in 
paradise, purgatory, hell," both of which imply that their children will be given 
a religious education, and so on). More generally, however, applications of 
Boolean analysis tend to be very much in the spirit of the work described in this 
review, and to resonate especially well with those works in which meanings 
are seen to be a constituent element of institutional patterns. For example, John 
Foran (1997) uses QCA to analyze the role that five factors (dependent devel- 
opment, economic crisis, repressive/personalist state, world system opportu- 
nity, and political cultures of resistance) have played in the success, failure, or 
quiescence of a dozen recent revolutionary situations. Foran demonstrates that 
all factors must be present for success, and that the last three of these factors, 
including a particular cultural orientation, are especially salient. 


Sequence Analysis 


Sequence analysis is another qualitatively oriented methodology that is in- 
creasingly being employed by sociologists. These methods are used for find- 
ing reduced form patterns in the sequencing of events through time. Andrew 
Abbott (1988, 1992) has been instrumental in bringing these methods into 
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mainstream sociology. This is another methodological approach in which ele- 
ments within a system are compared on the basis of their similarities to one an- 
other. These kinds of methods can also be used for identifying certain types of 
cultural meaning structures. 

Abbott & Forrest (1986) used these methods to understand cultural diffu- 
sion among communities of English Morris Dancers. By studying the sequenc- 
ing of dance steps, Abbott & Forrest were able to identify common dance tradi- 
tions that they used to analyze the diffusion of cultural forms among nine- 
teenth-century rural English communities. Other methods have also been sug- 
gested for analyzing narrative structures. Peter Abell’s (1987) work empha- 
sizes the comparison of sequences in multiple narratives. David Heise (1988, 
1989) has developed Event Structure Analysis (ESA) and a computer pro- 
gram, ETHNO, for its implementation which has been applied by Larry Griffin 
(1993) to studying the unfolding of historical event logics. Here again, a struc- 
tural representation is created of a particular institutional (meaning) system 
that can then be used as a formal basis for comparative analysis. 


MEASURING THE DUALITY OF SOCIAL AND 
CULTURAL STRUCTURES 


I started this review by noting that theorists have emphasized the duality of so- 
cial and cultural structures and I suggested that one important reason for meas- 
uring meaning is that doing so allows us to take this theoretical mandate seri- 
ously, The three examples of institutional analysis with which I began were se- 
lected to illustrate how the measurement of meaning can be used to analyze the 
cultural construction of social structures. It should be apparent, however, that 
we also need to attend to the ways in which social structures produce cultural 
meaning systems. Indeed, my earlier discussion of practice theory would sug- 
gest that such an approach 1s critical for cultural analysis. My focus until now 
has been to suggest some ways in which meaning structures can be measured. 
In this section, I discuss several methods that can be employed to focus atten- 
tion on the duality that inheres between cultural and social structures. These 
are the class of methods that are described as two-mode data analytic strategies 
because they simultaneously order both columns and rows of a data matrix. 
Several methods are available. 


Correspondence Analysis 


Probably the most familiar example of this kind of approach is correspondence 
analysis (Weller & Romney 1990). Like MDS, correspondence analysis can be 
used to represent a set of cultural items in a dimensional space, thereby allow- 
ing their underlying structural patterns to be represented visually. At the same 
time, correspondence analysis enables the relational foundations of the cul- 
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tural system to be treated as a set of (social-structural) elements that are them- 
selves ordered in terms of their relations to the cultural domain. Thus, this 
methodology allows both the social and the cultural dimensions to be plotted 
within the same measurement space. 

As an early proponent of practice theory, Pierre Bourdieu was naturally 
drawn toward this methodology and he has used it extensively in his research. 
In Distinction (1984), Bourdieu uses data on the cultural tastes of different 
class fractions to identify the class based logic of cultural goods. Thus, Bour- 
dieu shows that taste in music, film, painting, recreational activities, and other 
cultural domains is organized according to the structure of social classes 1n 
France, and using correspondence analysis, he is able to simultaneously plot 
the various social locations (e.g. the determinants of class structure) according 
to their relative location within the space of cultural distinctions. These analy- 
ses have been central to Bourdieu's theoretical goal of measuring the relation- 
ship between social and cultural domains, and he used them to demonstrate 
that, for example, class locations are ordered according to two salient dimen- 
sions—the total volume and relative composition of cultural and economic 
capital. 

Because of its ability to represent the duality of mutually constituted struc- 
tures, correspondence analysis is beginning to be used more frequently by 
American sociologists as well (although other approaches have also begun to 
be developed, such as the proposal of Borgatti & Everett 1997 for using MDS 
to analyze two-mode data). Ann Mische (1998), for example, uses correspon- 
dence analysis to analyze how youth movements in Brazil were structured by 
the types of political beliefs which they held at the same time that the political 
discourse itself was structured by the groups who were constituting it. 


Lattice Analysis 


Galois lattices are another approach that 1s explicitly oriented toward the rep- 
resentation of the dual ordering of rows and columns. Whereas correspon- 
dence analysis can be viewed as a two-mode extension of MDS techniques, 
lattice analysis provides a dual mapping of rows and columns based on Boo- 
lean set theory. 

Mohr & Duquenne (1997) use lattices to analyze the changing institutional 
logics of poverty relief during the Progressive Era. Here the duality between 
cultural meanings and social practices is operationalized explicitly. Reform- 
ers’ classifications of the poor (distressed, destitute, fallen, deserving, home- 
less, indigent, misfortunate, needy, poor, stranger, and worthy) are shown to 
be embedded within a hierarchically ordered meaning system by the structur- 
ing of organizational practices (giving advice, giving food, giving money, pay- 
ing a person to chop wood, placing a relief applicant in an asylum, and so on). 


MEASURING MEANING 363 


The use of lattices demonstrates how it was that the cultural and practical lo- 
gics were mutually constitutive. In other words, the ordering of the poverty 
classifications is shown to be determined by the ordering of the relief activities 
and vice versa. By mapping the changes in this dualistic structure through 
time, Mohr and Duquenne demonstrate how the institutional logic of the nine- 
teenth century poorhouse system was replaced by a far more progressive sys- 
tem founded in the rhetoric and practice of social work professionals. 

The paper by Thomas Schweizer (1993) discussed earlier provides another 
example. Recall that Schweizer analyzes the relationship that inheres between 
the status logic of material possessions and the social ranking of individuals. 
He shows that the meaning of possessions can be interpreted by understanding 
what members of the community possess what types of goods. Simultaneously 
(and dually), Schweizer assesses the ranking of individual community mem- 
bers in the social order by observing what material goods they possess. The 
structural duality of these two orders (the individual members of the commu- 
nity and the material possessions that they hold) consists in the fact that the or- 
dering of one is simultaneously dependent upon the ordering of the other. Vin- 
cent Duquenne (1995) provides an additional example of the use of lattices to 
analyze status orders and material wealth. 


Hierarchical Classification Models 


A final example of two-mode analytic techniques is the hierarchical classifica- 
tion model developed by Paul de Boeck and Seymour Rosenberg, which is im- 
plemented in the software program HICLAS (Rosenberg et al 1995). This is an 
interative algorithm that accomplishes the same goal as a lattice analysis by 
employing set theoretical principles to cluster items in the rows (objects) and 
the columns (attributes) of a two-way two-mode binary matrix such that 
equivalent items are classed together, the classes of objects are hierarchically 
ordered, the classes of attributes are hierarchically ordered, and the two hierar- 
chical orders are relatd to one another. 

This technique has been employed to measure several types of cultural 
meaning systems. Anheier & Gerhards (1991b) use this method to understand 
how famous writers (Ernest Hemingway, William Shakespeare, and others) 
were organized into a field of literary influence by comparing which contem- 
porary German writers cite them as having been inspirational in their own 
work. The results allow Anheier & Gerhards to describe the dual mapping of 
blocks of contemporary writers onto a clustering of literary influences in much 
the same way as Bourdieu’s analysis maps social classes and cultural tastes in 
the same measurement space. Rosenberg (1989) uses HICLAS to analyze 
meaning structures in Thomas Wolfe's autobiographical novel, Look Home- 
ward, Angel. In this case the set structure of various attributes of individuals is 
dually mapped onto the set structure of characters within the novel. Rosenberg 
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shows with this method that members of Wolfe's family are organized in clear 
subsets of attributes (his brother Fred is a subset of his sister Mabel, who is a 
subset of his mother). Rosenberg also demonstrates that Wolfe's description of 
his own identity at different ages is hierarchically nested, and that Wolfe's de- 
scription of his self shifts through life to being more and less like various mem- 
bers of his family. 


ADVANTAGES AND DISADVANTAGES OF 
FORMAL ANALYSIS 


Precisely because the measurement of meanings lies at the boundaries of the 
more scientistic and the more humanistic approaches to analyzing social phe- 
nomena, it is guaranteed to create controversy. Norman Denzin (1991), for ex- 
ample, complains that scientifically oriented studies of culture have a tendency 
to reify and reduce the intrinsic complexities of cultural meanings. There is no 
doubt that he is right about this. Al] measurement projects end up reifying real- 
ity to some extent. Moreover, it is clear that there are other more hermeneutic 
and post-structuralist approaches to the problem of interpretation that proceed 
along very different pathways and have a great deal to offer. It would be a mis- 
take to presume that the type of structuralist models of meaning that have been 
reviewed here are in any sense exhaustive or even that they should be ceded 
some sort of empirical primacy. 

Measuring meaning structures does have its benefits, however. As Robert 
Merton (1957) suggested in his classic essay regarding the advantages of em- 
pirical research, the use of formal methods brings a pressure for the clarification 
of one's concepts and may lead to the discovery of anomalous findings that 
call theoretical assumptions into question. Moreover, as Hage & Harary (1983, 
p. 9) argued in their attempt to persuade anthropologists of the advantages of 
graph theory, the shift toward quantitative relational methodologies also has a 
number of more specific advantages. While my focus in this review has been 
somewhat different, I think Hage & Harary's list is applicable here as well. In 
somewhat more generalized form, these advantages are the following: 


pe 


. Structural models are iconic. They look like what they represent. Whether 
one uses MDS techniques to generate a picture of the similarity relation- 
ships between cultural items or one employs a lattice to map out the hierar- 
chical duality of cultural items and actions, these techniques allow us to see 
the patterns of difference out of which meanings are constructed. Once we 
see these patterns, we can begin to understand them. 

2. Structural models provide us with a rich conceptual vocabulary for thinking 

about meanings. Simplicity, complexity, centrality, duality, permutations, 

and transformations are ideas that have a natural foundation in structural 
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models. As we try to grapple with understanding phenomena such as cultural 
boundaries, cultural identities, cultural differentiation, and cultural narrativ- 
ity, I suspect that these kinds of structural concepts will be of much use to us. 


3. Structural models provide us with the tools for quantifying aspects of mean- 
ing structures. As I have tried to indicate here, we can quantify meanings 
without reducing them to some artificially linear metric. Instead we can 
build structural models that simply represent relationships of similarity and 
difference that are embedded within practices. Once we have found a way 
of measuring these kinds of phenomena, however, we (and others) are in a 
position to be able to replicate our interpretations and to subject our ideas to 
various kinds of formal tests to help assess the validity of our interpretation. 


CONCLUSION 


While I will be pleased if qualitatively oriented scholars are persuaded by the 
merits of these arguments and the utility of these techniques, my primary goal 
has been to convince sociologists who already use quantitative methods to em- 
brace the measurement of meaning and to incorporate these measures into a 
more balanced approach to social research that recognizes the duality of cul- 
tural and social structures. In this regard I fully agree with Harrison White 
when he argues that “interpretive approaches are central to achieving a next 
level of adequacy in social data" (1997:57—58). I also agree with Jepperson 
and Swidler's (1994) contention that culture 1s no more intrinsically difficult 
to measure than other social phenomena and that the greatest impediments to 
the formal analysis of culture are conceptual rather than methodological. 

I have focused here on one very basic level of conceptualization. I have ar- 
gued that cultural meanings are built up out of structures of difference and that 
by attending to the patterns of relations that link items within a cultural system, 
we can use formal methodologies to measure and analyze meaning structures. I 
have also suggested that a critical componenet of any such endeavor is deter- 
mining the linkage between meanings and practices and that relations between 
the former should be determined by their embeddedness within the latter, and 
vice versa. Beyond this, I have said very little here about what purposes these 
measurements should be put to or how cultural analysis should proceed. I have 
done this quite intentionally because I believe that these are general purpose 
methods that can be employed to accomplish a great variety of things. This 
should not be taken to indicate that a theoretical context for their use is unnec- 
essary or undesirable. On the contrary, to use these methods without a theoreti- 
cal goal in mind is unlikely to yield much that 1s of sociological interest. 

In this review I have provided a sampling of the types of research projects 
that are beginning to appear in the sociological literature that seek to analyze 
the structure of institutional meanings. As I suggested earlier, there is as yet no 
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coherent body of literature or collection of scholars identified with this project. 
There is, however, tremendous enthusiasm and opportunity for research of this 
sort. Before closing, it seems appropriate to offer a small list of suggestions for 
scholars who have an interest in this style of research. In summary form, my 
advice is to first, get the text, second, find the use, and third, map the meaning. 


Get the Text 


Meanings are complex things. Any attempt to model them or subject them to 
formal analysis invariably involves a gross simplification of the cultural mate- 
rial. As a consequence, it behooves us to postpone this simplification process 
as long as is feasible. If we can wait until the third step—the stage at which we 
are consciously trying to reduce the data according to some structural princi- 
ples—we have much greater control over the way in which we simplify the 
cultural material. In practical terms, this means that we should always try to 
gather data as unobtrusively as possible. It is best to avoid imposing an arbi- 
trary coding scheme on the data if it can be avoided. Instead, we should try to 
find a way to get the entire "text" into a computer (or to come as close as possi- 
ble to this ideal). 

In this regard it is probably worth pointing out that we are just now entering 
what must surely be the golden age of textual analysis. What sets this moment 
in history apart 1s the incredible proliferation of on-line and on-disk textual 
materials. Previously, scholars who were interested in doing some form of 
content analysis were compelled to spend huge amounts of time readying their 
texts for analysis. Now one can easily sit at one's desk and more or less instan- 
taneously summon up a fantastic array of cultural texts in electronic form. This 
includes software editions of many contemporary novels (currently being mar- 
keted for use in laptop computers) for about the same price as a hardbound 
book, newspaper and magazine articles (published on the Internet), annual re- 
ports of corporations (available on CD-Rom), full transcripts of free-form con- 
versations and other types of social exchanges (occurring in virtual “chat- 
rooms" and user interest groups on the Internet), to name but a few. 


Find the Use 


As I argued earlier, the most critical component of the meaning measurement 
process is the identification of the systems of relations that will be used to gen- 
erate the assessment of similarities and differences among cultural items. Be- 
cause there are invariably any number of ways in which these relations might 
be assessed, it is essential that a theoretically informed metric be applied at this 
stage of the analysis. The best rule of thumb in this situation is to locate and 
evaluate the relevant domain of practical activity in which the identified sys- 
tem of cultural meanings is embedded. Differences in practice produce (and 
are produced by) differences in meaning. Therefore the goal of an empirical 
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analysis should be to assess how the various cultural elements are differen- 
tially implicated in alternative forms of practice. 


Map the Meaning 


Not all of the examples of cultural interpretation that I' ve described in this re- 
view use structural models as a way of representing cultural meanings. Clearly 
situations exist wherein this kind of formal approach simply isn't feasible or 
desirable. Nonetheless, part of what I've tried to suggest here is that the project 
of finding meaning structures is not really different from the project of measur- 
ing meaning structures, or rather, that there 1s a difference in the degree of for- 
mality rather than in the type of endeavor. But, having said that much, I do 
think that we can profitably make greater use of formal methods and especially 
the sorts of two-mode modeling techniques that I discussed earlier. 

Finally, the techniques that I have referred to are readily available and usu- 
ally quite simple to use. Scaling and correspondence analysis programs are in- 
cluded in most major statistical packages. Boolean algebras and sequence 
analysis programs are easily obtainable and geared to run on most microcom- 
puters. Network analysis packages are likewise widely available and, nowa- 
days at least, quite user-friendly and intelligible. 

Of course, as anyone who has employed these methods will readily testify, 
there is nothing simple or determinate about the interpretive work that one 
must perform after having completed a structural analysis. In a sense, once one 
has mapped the meaning structure of a given institutional practice, one is in the 
same position as a scholar who has just emerged from an intensive field study, 
chock full of ideas and images. Any visual representation of a meaning struc- 
ture is still largely a Rorschach test upon which one must seek to project an in- 
terpretation. But, to the extent that one has carefully thought about the linkage 
between culture and practice and how it informs one's data selection, measures 
of differentiation, and structural modeling, so too one should have little diffi- 
culty in interpreting the meaning structures that have been identified. 
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ABSTRACT 


Research on social movements has usually addressed issues of movement 
emergence and mobilization, yet has paid less attention to their outcomes and 
consequences. Although there exists a considerable amount of work on this 
aspect, little systematic research has been done so far. Most existing work fo- 
cuses on political and policy outcomes of movements, whereas few studies 
address their broader cultural and institutional effects. Furthermore, we still 
know little about the indirect and unintended consequences produced by 
movements. Early studies have dealt with the effectiveness of disruptive and 
violent actions and with the role of several organizational variables for 
movement success. More recently, scholars have begun to analyze move- 
ment outcomes in their political context by looking at the role of public opin- 
ion, allies, and state structures. À comparative perspective promises to be a 
fruitful avenue of research in this regard. 


INTRODUCTION 


If we trust our intuitions, the last big European cycle of protest caused such 
fundamental changes in the social and political structures that we are still won- 
dering about the kind of world we are now living in. In the eyes of a neutral ob- 
server, the democracy movements that shook Eastern Europe in 1989 were 
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clearly instrumental in bringing about the new order. Mass actions and street 
demonstrations in Czechoslovakia, East Germany, Hungary, Poland, and Ro- 
mania have brought about the fall ofthe Communist regimes in those countries 
and, together with popular mobilizations in the Baltic Republics later on, the 
collapse of the Soviet Union in 1991. That the movements must have played a 
significant role can be seen in the impressive growth of popular mobilizations 
in those countries. Take the example of East Germany. Oberschall (1996) re- 
ports an impressive increase in the number of participants ın protests and dem- 
onstrations in Leipzig, where the key events took place during 1989. Whereas 
the celebration of the anniversary of Karl Liebknecht and Rosa Luxemburg on 
January 15 saw the presence of 150—200 participants, the protest marches from 
Nikolai church to the center, which (starting from October 16) took place 
every Monday until Christmas, mobilized from 110,000 to 450,000 people. 
Yet even the most relentless optimists would concede that, without major 
changes in the structures of power, the protests and mass demonstrations 
would hardly have had such dramatic consequences. In fact, one can argue that 
in the absence of such changes the movements themselves would not take on 
such a big scale. Two major transformations in the states' structures gave a big 
boost to the democracy movements in Easter Europe and helped them change 
our world: Gorbachev's perestroika and the cracks in the Communist states’ 
alliance system. Movement mobilization and state breakdown combined in a 
complex way to bring about a revolutionary outcome. 

Another example: During the summer of 1995 the Dutch oil company Shell 
announced plans to destroy the Brent Spar offshore oil rig located in the North 
Sca because it became unusable. This decision provoked the immediate reac- 
tion by outraged environmentalist groups, especially Greenpeace, which fore- 
saw an ecological disaster and called for a boycott of Shell products world- 
wide. Many consumers took the boycott seriously and the company's sales 
went down considerably in the days following the appeal. Particularly in Ger- 
many, drivers avoided Shell's gas stations in favor of other companies. Wor- 
ried by the fall of sales and the bad public image it was receiving, the oil com- 
pany abandoned the project of destroying the oil rig, thus conceding a signifi- 
cant victory to Greenpeace and the environmental movement. 

This example is very different from the previous one. For one thing, the 
events were much more limited in time, space, and scope. While the revolu- 
tions in Eastern Europe lasted several months (indeed, a very short time for a 
revolution), involved thousands of participants, and had dramatic social and 
political repercussions for the entire world, the Greenpeace boycott was called 
by a single organization and was successful within a few weeks, but this cer- 
tainly did not alter the foundations of contemporary society. Another differ- 
ence is that Greenpeace activists had seemingly expected—or at least 
hoped—that Shell would withdraw from its decision, whereas no one could 
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have foreseen the fundamental changes brought about by the opposition to the 
Communist regimes. Despite these differences, the two examples taken to- 
gether illustrate several problems and difficulties inherent in the study of the 
consequences of social movements. The principal difficulty is how to establish 
a causal relationship between a series of events that we can reasonably classify 
as social movement actions and an observed change ın society, be it minor or 
fundamental, durable or temporary. Both our examples display social move- 
ment activities and were followed by changes that the movements had asked 
for, although the scope of those changes, in one case, went well beyond any 
possible anticipation. But the problem of causal attribution remains the same. 
Even for the apparently more obvious effect in the Brent Spar case, we cannot 
a priori exclude the intervention of a third party (a member of the political 
elite, for example) which may have caused the withdrawal of Shell's decision. 
In addition, both the protest cycle for democracy and Greenpeace's call for a 
boycott might have had a series of long-term consequences that neither the 
populations of Eastern Europe nor Greenpeace activists had planned. As I 
hope it will become clear by the end of the paper, these methodological prob- 
lems can only be resolved theoretically. 

Social scientists often have a hard time finding consensus on many aspects 
of their collective enterprise. Students of social movements are certainly no 
exception to this rule. They often disagree on the causes of protest, its develop- 
ment over time, its fate, and the methods of analysis. Yet they all seem to agree 
that the study of the effects of social movements has largely been neglected, 
and it has become common sense to cite this state of affairs (e.g. Berkowitz 
1974, Gurr 1980, McAdam et al 1988, Tarrow 1993). Such neglect is quite as- 
tonishing, for the ultimate end of movements is to bring about change. The 
field, however, is not as empty as several observers have maintained.! Never- 
theless, we still lack systematic empirical analyses that would add to our 
knowledge of the conditions under which movements produce certain effects. 
Furthermore, a striking disparity exists between the large body of work on po- 
litical and policy outcomes and the sporadic studies on the cultural and institu- 
tional effects of social movements. This review reflects this state of affairs. 
(For previous reviews see Amenta et al 1992, Burstein et al 1995, Gurr 1980, 
Jenkins 1981, McAdam et al 1988, Mirowsky & Ross 1981, Schumaker 1978.) 
I first address the two main axes of early research: the moderation/disruption 


l'The fact that, due to lack of space, I had to leave out a great many existing works is a clear 
indication that there ıs a large body of literature on movement consequences. I have provided a 
more exhaustive overview of the extant Itterature ın another paper (Grugm 1994), on which the 
present one 18 partly based It should also be noted that studies of social revolutions, msofar as the 
latter are the product of social movements or coalitions between movements and opposrtional 
elites, may be considered as the most dramatic effect of movements Again, for space reasons I will 
not deal with this aspect 
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axis and the organization/disorder axis. Second, I review work that has at- 
tempted to put movements and their outcomes in their larger social and politi- 
cal context. Third, I point to some logical as well as methodological problems 
of existing work that have prevented the cumulative gathering of systematic 
knowledge. In the end, I hope to be able to show that, while there exists a con- 
siderable amount of work on this topic, little systematic research has been 
done. This is especially true when it comes to comparisons across countries 
and across movements to specify the conditions that foster certain types of im- 
pact, an approach that I view as one of the most promising avenues for future 
research. 


THE POWER OF MOVEMENTS 


Most research so far has focused on the intended effects of social movements. 
Early work has looked ia particular at the impact of movement-controlled vari- 
ables by attempting to single out the characteristics of movements that are 
most conducive to success or, more generally, that help certain outcomes to 
occur. In this respect, one can discern two closely interrelated lines of investi- 
gation. The first line concerns the impact, mostly on policy, of various organ- 
izational variables and has brought researchers to ask whether strongly organ- 
ized movements are more successful than loosely organized movements. The 
second line of inquiry has looked at the effects of disruptive and violent protest 
behavior and has opened a debate in the literature about whether the use of dis- 
ruptive tactics by social movements is more likely to lead to policy changes 
than moderate tactics This debate has largely dealt with the effectiveness of 
violence. Let us briefly discuss each of these two aspects. 


The Impact of Organization 


Resource mobilization theory has dominated the study of social movements 
and contentious politics for at least three decades. It is therefore little surpris- 
ing that research on movement outcomes has paid a lot of attention to the role 
of the organizational characteristics of movements. There is a fair amount of 
theoretical and empirical work that links various movement-controlled vari- 
ables to their alleged impact. While early theoretical work has speculated over 
the link between government responsiveness and the nature of movement de- 
mands, organizational size and stability, leadership, and strategies (e.g. Etzi- 
oni 1970, Lipsky & Levi 1972), other authors have tried to show 1t empirically. 
Brill's (1971) finding (based on a case study of rent strikes) that success is not 
likely to result if the movement leaders are unable to build an effective organi- 
zation is typical in this respect. Relevant work includes Shorter & Tilly's 
(1974) examination of the effect of organizational variables on the outcomes 
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of strikes in France, Staggenborg's (1988) inquiry into the consequences of 
professionalization and formalization in the pro-choice movement, and Cle- 
mens' (1993) investigation of the impact of organizational repertoires on insti- 
tutional change. We also have a substantial body of literature on the effects of 
lobbying strategies on governmental decisions and congressional action (e.g. 
Fowler & Shaiko 1987, Milbrath 1970, Metz 1986). However, these studies of- 
ten are more concerned with interest-group politics than social movements 
themselves. 

Important evidence about the relationship between various organizational 
variables and the success of social movements comes from Gamson's The 
Strategy of Social Protest (1990), which after more than two decades remains 
perhaps the most systematic attempt to inquire into the impact and effective- 
ness of social movements. The author's comprehensive analysis of the careers 
of 53 American challenging groups active between 1800 and 1945 led him to 
conclude that (a) groups with single-issue demands were more successful than 
groups with multiple-1ssue demands, (5) the use of selective incentives was 
positively correlated with success, (c) the use of violence and generally disrup- 
tive tactics was associated with success, while being the objects of violence 
made it more difficult (as we will see in more detail below) and (d) successful 
groups tended to be more bureaucratized, centralized, and unfactionalized, 
which is the most important point for the present purpose. Finally, he tested the 
role of context variables and found that times (quiet or turbulent) did not mat- 
ter much, whereas political crises seemed to have an effect on the outcomes of 
the challenging groups examined. 

Gamson's work has raised a number of criticisms, mostly methodological 
(Goldstone 1980, Gurr 1980, Snyder & Kelly 1976, Webb et al 1983, Zelditch 
1978), but also a series of reanalyses of his data, which the author had ap- 
pended to the book (Frey et al 1992, Goldstone 1980, Mirowsky & Ross 1981, 
Steedly & Foley 1979).? As in the case of the role of disruptive tactics, most of 
these works have confirmed Gamson's principal findings, at least in part. For 
example, Steedly & Foley (1979), using more sophisticated techniques, found 
group success related, 1n order of relative importance, to the nondisplacement 
nature of the goals, the number of alliances, the absence of factionalism, the 
existence of specific and limited goals, and the willingness to use sanctions. 
similarly, Mirowsky and Ross (1981), aiming at finding the locus of control 
over movement success, found protester-controlled factors more important 
than the support of third parties or the situation for a successful outcome. Of 
these protester-controlled factors, the organization and, above all, the beliefs 
and goals were seen as crucial for success. More recently, Frey et al (1992) 


2These reanalyses have been included ın the book's second edition (Gamson 1990). 
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pointed to the importance of not having displacement goals and group faction- 
alism to obtain new advantages. Thus, Gamson's central argument stressing 
internal variables and resource mobilization as determinants of group success 
found further support. However, Piven and Cloward’s (1979) thesis that move- 
ments have a chance to succeed to the extent that they avoid building a strong 
organization brought a fundamental criticism to Gamson's stress on the effec- 
tiveness of organization, a criticism that has triggered a debate in both schol- 
arly and general audience journals. In addition, Goldstone's (1980) reanalysis 
of Gamson's data cast serious doubts over his findings and pointed to a per- 
spective on social movement outcomes that takes into account their broader 
political context. Before I return to this aspect, I would like to discuss the sec- 
ond main axis of existing research: the impact of disruption. 


The Effectiveness of Disruptive and Violent Protest 


Overall, the use by social movements of disruptive tactics and violence seems 
to increase their potential for change. Several authors have argued that, con- 
trary to the pluralist claim that moderation in politics is more effective than 
disruption, the use of force by social movements increases the chances that 
they reach their goals (Astin et a] 1975, McAdam 1983, Tarrow 1994, Tilly et 
al 1975). Again, Gamson's (1990) study provides empirical evidence ofthe ef- 
fectiveness of violence and the use of constraints. He found that the use of vio- 
lence and, more generally, disruptive tactics by challenging groups was posi- 
tively correlated to his two measures of success: the acceptance of challengers 
as legitimate claimants and the obtaining of new advantages for constituents. 
These findings are backed up by some of the aforementioned reanalyses of his 
data, in particular those by Mirowsky & Ross (1981) and Steedly & Foley 
(1979). Yet there is no consensus on this point, nor on the implications of this 
for movements. 

Much evidence on the relationship between disruptive or violent movement 
tactics and their impact comes from two important strands of research: the 
study of strikes and the many analyses of the wave of urban riots that occurred 
in several American cities at the end of the sixties. As far as strike activity is 
concerned, Taft & Ross (1969), on the basis of a study of violent labor con- 
flicts in the United States through 1968, found little evidence that violence 
would help unions to reach their goals. A similar conclusion has been reached 
by Snyder and Kelly (1976). By analyzing quantitative data on strikes that oc- 
curred in Italy between 1878 and 1903, Snyder and Kelly were able to show 
that violent strikes were less successful than peaceful ones. These results con- 
tradict those obtained by Shorter & Tilly (1971) in their study of strikes in 
France, who found a positive correlation between the use of violence and strike 
outcomes. Research on strike activity, however, has gone beyond the specific 
question of disruption or violence to examine broader issues related to the in- 
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dustrial conflict (Cohn 1993, Franzosi 1994, Hicks & Swank 1984, Swank 
1983, Shorter & Tilly 1974, Snyder & Kelly 1976). 

The effectiveness of disruptive protest and movements has been analyzed 
thoroughly in the aftermath of the urban riots of the 1960s in the United States 
(for reviews see Gurr 1980, Isaac & Kelly 1981, Piven & Cloward 1993). To 
be sure, rioting behavior and social movements are not equivalent, though they 
are both instances of contentious politics, defined as “collective activity on the 
part of claimants—or those who claim to represent them—relying at least in 
part on noninstitutionalized forms of interaction with elites, opponents, or the 
state" (Tarrow 1996:874). Social movements, on the other hand, may be de- 
fined as “sustained challenges to powerholders 1n the name of a disadvantaged 
population living under the jurisdiction or influence of those powerholders" 
(Tarrow 1996:874; see also Tarrow 1994, Tilly 1984). However, studying riots 
can yield important insights on the effectiveness of disruption and violent pro- 
test by social movements. In addition, the American riots of the sixties have 
sparked the interest on the latter aspect among students of social movements. 
Some authors, including Hahn (1970), McClurg Mueller (1978), Isaac & Kelly 
(1981), Kelly & Snyder (1980), and Sears & McConahay (1973), have focused 
explicitly on the effects of violence. In general, the evidence gathered does not 
allow for a definitive answer to the question whether rioting is beneficial or 
detrimental to the population involved. Kelly & Snyder (1980), for example, 
suggested that there is no causal relationship between the frequency and sever- 
ity of violence displayed in American cities during the 1960s and the distribu- 
tion of black socioeconomic gains at the local level, either by income level or 
by employment and occupational changes. Feagin & Hahn (1973), in a mono- 
graph on ghetto riots, maintain that the latter led at best to limited reform and 
mostly to changes in police policies. Nevertheless, the authors did not provide 
systematic evidence for their argument. Berkowitz (1974), who looked at so- 
cioeconomic changes at the neighborhood level brought about by ghetto riots 
between 1960 and 1970, found no differential improvement for riot tracts, ar- 
guing against a positive effect of the riots (see also Levitan et al 1975). Even 
more pessimistically, Welch (1975) showed that the riots led to an increase in 
urban expenditures for control and punishment of rioters, and much less in 
their favor. However, Colby's (1975) findings in a way contradict Welch's, 
because he found that the riots had a positive influence on redistribution pol- 
icy, though no influence on regulatory policy at the state level. On the other 
hand, Jennings (1979), also through a comparison of states but over time as 
well, found some support for a positive correlation between the number of riots 
and the increase in AFDC recipients. 

Many studies of the urban riots in American cities are directly related to 
Piven & Cloward's (1993) well-known thesis about the regulating functions of 
public welfare (for reviews see Piven & Cloward 1993, Trattner 1983). As itis 
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known, these authors provocatively argued that welfare systems serve two 
principal functions: to maintain a supply of low-wage labor and to restore or- 
der in periods of civil turmoil. According to this thesis, hence, turmoil and dis- 
ruptive actions do provoke policy change, but this can hardly be seen as suc- 
cess, for such concessions are usually withdrawn once the turmoil subsides. A 
series of studies carried out during the 1970s and 1980s attempted to reexam- 
ine this thesis (e.g. Albritton 1979, Betz 1974, Colby 1982, Hicks & Swank 
1983, Isaac & Kelly 1981, Jennings 1979, 1980, 1983, Schramm & Turbott 
1983, Sharp & Maynard-Moody 1991). In addition, other authors have ad- 
dressed Piven and Cloward's argument, but focusing on the relief expansion of 
the thirties (e.g. Jenkins & Brents 1989, Kerbo & Shaffer 1992; see further 
Valocchi 1990). Again, although much of the disagreement with Piven and 
Cloward's thesis bears not so much on the results in themselves, but rather 
their interpretation, in the whole it is difficult out of this impressive amount of 
empirical work to provide a clear-cut answer to the question whether disrup- 
tion can produce policy changes and, if so, what this means for the movements. 

Such uncertainty of results calls for a conditional analysis that singles out 
the circumstances under which violence matters. This task was accomplished 
by Button (1978), among others, in one of the most comprehensive empirical 
studies of the political impact of the 1960s riots. He maintained that violence is 
conducive to political and social change under five general conditions: 1. when 
powerholders have enough public resources to meet the demands of the move- 
ment, 2. when violent actions and events are neither too frequent as to cause 
massive societal and political instability nor severe enough to be noticed and to 
represent a threat, 3. when a relevant share of powerholders and the public are 
sympathetic to the goals of the movement and the violence is not so severe as 
to undermine this sympathy, 4. when the aims and demands of the movement 
are relatively limited, specific, and clear, and 5. when violence is adopted in 
combination with peaceful and conventional strategies (Button 1978). But- 
ton's approach has the advantage of avoiding the formulation of a too-simple 
causal relationship between the use of violence and its outcomes On the other 
hand, it seems so broad as to run the risk of leading to trivial results. A nar- 
rower argument in this respect has been put forth by Schumaker (1978), who 
has looked at the conditions under which disruptive tactics work. His results 
suggest that the use of constraints is more effective when the conflict is limited 
to the protest group and their target (1.e. when the scope of conflict is narrow). 
In contrast, when the public becomes involved 1n the conflict (1.e. when the 
scope of conflict is broad), the use of constraints tends to reduce the chances of 
a successful outcome. Other analyses based on the 1960s urban riots, however, 
suggest that militancy is generally not conducive to success (Schumaker 
1975). Similarly, a study of official responses to 60 protest incidents that oc- 
curred in the Philippines, Malaysia, and Thailand between 1960 and 1977 
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showed that the use of violent constraints (i.e. militancy), except when the 
group of protesters was large, had negative effects on the protest's outcomes 
because repression was more likely to occur (O'Keefe & Schumaker 1983). 


MOVEMENT OUTCOMES IN CONTEXT 


To summarize the paper so far, existing research on the impact of several inter- 
nal characteristics of social movements, such as the use of disruptive tactics 
and actions, seems to yield contradictory findings. Nevertheless, this contra- 
diction may well be more apparent than real. The puzzle may be solved once 
we acknowledge the crucial role of the broader political context in facilitating 
or constraining both the mobilization and the potential outcomes of move- 
ments, Strategies that work in a given context may simply be ineffective in 
other political settings and vice versa. Thus, more recent work has shifted 
away from the study of the effectiveness of disruption and the organizational 
characteristics of social movements toward the environmental conditions that 
channel their consequences. This has been done in two distinct directions. 
First, the role of public opinion in facilitating or preventing movements to 
make an impact has been thoroughly investigated, particularly in the United 
States. A major turn in the study of movement outcomes, however, has sec- 
ondly come from comparative analyses that attempt to link thein to the move- 
ments’ political context. Next I briefly consider these two avenues of research. 


Public Opinion 


Social movements, particularly when they express themselves through their 
most typical form of action, public demonstrations, address their message si- 
multaneously to two distinct targets: the powerholders and the general public. 
On the one hand, they press the political authorities for recognition as well as 
to get their demands met, at least ın part. On the other hand, they seek public 
support and try to sensitize the population to their cause. At the same time, the 
most common political targets of contemporary movements, namely local or 
national governments, pay particular attention to public opinion and fluctua- 
tions therein. All this makes a strong case for taking public opinion into ac- 
count as an important external factor 1n the study of the outcomes of social 
movements. This has been done above all in the United States. Public opinion 
has entered the study of movement outcomes both as explanatory variable and 
explanandum. In the former case, one examines how and to what extent move- 
ments produce changes in the perceptions people have of a given issue (e.g. 
Gusfield 1981, Lawson 1976, Oberschall 1973, Orfield 1975). However, while 
it seems rather obvious that protest activities raise the awareness of the popula- 
tion over certain political issues, changes in public opinion can also help 
movements to reach their goals by making decision-makers more responsive 
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to their demands. Hence, several authors have stressed the role of public opin- 
ion for legislative change (e.g. Burstein 1979a—c, 1985, Burstein & Freuden- 
burg 1978, Costain & Majstorovic 1994, Page & Shapiro 1983, Weissberg 
1976), though not always related to the impact of social movements. 

Paul Burstein is certainly among those who have paid most attention to this 
aspect. In his analysis of the struggle for equal employment opportunity in the 
United States, he showed that "equal employment opportunity legislation was 
adopted as the result of social changes that were manifested 1n public opinion, 
crystallized in the civil rights and women's movements, and transformed into 
public policy by political leaders" (Burstein 1985:125), thus pointing to the in- 
terconnections of public opinion, movement activities, and congressional ac- 
tion in bringing about policy changes for discriminated-against groups. In 
similar fashion, Costain and Majstorovic (1994) studied the multiple origins of 
women's rights legislation by stressing the same three sets of factors. As they 
argue, there are several views of the relationship between public opinion and 
legislative action. They see four prevailing interpretations: 1. a public opinion 
interpretation, stating a direct relationship between public opinion and legisla- 
tive change, 2. an interpretation that sees public opinion as filtering the impact 
of outside events on legislative action, 3. an elite behavior interpretation, ac- 
cording to which public opinion is affected by legislative elites, and 4. a social 
movement interpretation, whereby legislation results from the joint action of 
social movements, public opinion, and media coverage. The latter appears as 
the most plausible interpretation, for not only does it take into account both 
movement actions and changes in public opinion, but it also acknowledges the 
fundamental role of the media for movement mobilization and outcomes. The 
way in which the media cover, frame, and interpret social movements has 
largely been neglected in the existing literature. Together with the analysis of 
the role of political opportunty structures for movement outcomes, this is a 
promising avenue for future research (e.g. Gamson and Wolfsfeld 1993). 


Political Opportunity Structures 


As our initial example about the fall of the Communist regimes of Eastern 
Europe illustrates, and as Goldstone's (1980) reanalysis of Gamson's data 
made clear, the study of the outcomes of social movements cannot avoid tak- 
ing into account the political context in which they operate. On the basis of a 
series of methodological criticisms, Goldstone challenged both Gamson's 
main conclusions and his basic theoretical tenet. He found that the organiza- 
tional and tactical characteristics had no effect on group success. The timing of 
success, he maintained, is independent of the challengers' organization and 
tactics. What 1s most important, he suggested, is that the resource mobilization 
model be replaced by a model that stresses the crucial role of broad, system- 
wide national crises for the success of social movements. We have a name for 
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it: the political-process model. By looking at how external political factors af- 
fect protest behavior, this approach also stresses the importance of the move- 
ments' larger environment for their outcomes (e.g. Kitschelt 1986, Kriesi et al 
1995, McAdam 1982, Rochon & Mazmanian 1993, Tarrow 1994). This, I 
think, is a clear theoretical advance and a way to follow. 

The central concept in the political process model 1s that of political oppor- 
tunity structure. In spite of various conceptualizations, two aspects appear to 
be crucial for the understanding of the relation between social movements and 
their political environment: the system of alliances and oppositions and the 
structure of the state. The importance of having powerful allies both within and 
without the institutional arena has been stressed on several occasions. Early 
work focused in particular on the context of social support and conceived of al- 
liances as a political resource that movements can use to become more suc- 
cessful, since movements were considered as powerless challengers. One of 
the first systematic statements in this respect was made by Lipsky (1968; see 
also Lipsky 1970, Lipsky & Olson 1977), who saw movements as strongly de- 
pendent on the activation of third parties to be successful in the long run. Schu- 
maker (1975) arrived at a similar conclusion in his study of the responsiveness 
of political authorities to racial riots. On the other hand, third parties also in- 
clude opponents, which might influence the oversimplified relationship be- 
tween movements and the state and either prevent or facilitate their outcomes. 
Yet few authors have looked at the role of opponents (e.g. Barkan 1984, Jasper 
& Poulsen 1993, McAdam 1982, Turk & Zucker 1984). Following this per- 
spective, the effectiveness of social movements depends on their capability to 
engage in bargaining activities with allies and opponents (Burstein et al 1995). 

The importance of political resources and institutions for movement out- 
comes has also been stressed by Jenkins & Perrow (1977), who have suggested 
a link between changes in the political environment that offer social resources, 
on the one hand, and the rise and success of farm-worker insurgents, on the 
other hand. The conducive environment in their study is represented by the 
government and a coalition of liberal support organizations. Ultimately, they 
argue, the success of powerless insurgents 1s due to a combination of sustained 
outside support, the disunity of the political elites, and their tolerance, which 
provided the movement with crucial resources. Similarly, Piven and Cloward 
(1979) point to the important constraining role of institutions, which shape op- 
portunities for action, model its forms, and limit its impact. They maintain that 
protest is more likely to have a real impact when challengers have a central role 
in institutions and when powerful allies have a stake in those institutions. Spe- 
cifically, they view the electoral-representative system as a major factor medi- 
ating the political impact of institutional disruptions. 

In line with this emphasis on political institutions, more recent work has be- 
gun to follow what I see as the most promising avenue of research on the out- 
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comes of social movements: to carry on cross-national comparisons of move- 
ments and to examine one or more instances of a various array of their poten- 
tial consequences in order to formulate plausible causal theories about the link 
between movement actions and those consequences. In so doing, one can as- 
sess the filtering role of the political context on movement outcomes. Follow- 
ing this perspective, Amenta et al (1992) have shown for the case of early so- 
cial policy in the United States that the political mediation model, which places 
political opportunity structure as a mediating factor between social move- 
ments and their success, offers the best explanation. Ultimately, therefore, the 
state and the political party system determine whether social movements can 
win acceptance and new advantages. 

Although attempts at comparing movement outcomes across countries are 
not new (e.g. Gurr 1983, Kitschelt 1986, Kowalewski & Schumaker 1981, 
Midttun & Rucht 1994, Rüdig 1990), there is still a huge void 1n the literature 
as opposed to case studies of single movements or countries. The best known 
of these cross-national studies 1s probably that of Kitschelt. In his influential 
comparison of the antinuclear movement in four western democracies (Kit- 
schelt 1986), he makes a strong case for the structural determinants of social 
movement outcomes, arguing that the success of the antinuclear movement 1s 
strongly dependent on political opportunity structures. A more recent contri- 
bution elaborates on Kitschelt’s model to show the crucial role of political op- 
portunities in shaping the outcomes of Western European new social move- 
ments (Giugni 1995). Hopefully, other scholars will soon join these efforts and 
carry on comparative studies on the outcomes and consequences of social 
movements. 


SUCCESS, FAILURE, OUTCOMES, CONSEQUENCES 


At this point, there is need for clarification of certain terms of our discussion. 
So far, we have seen that a first strand of research has inquired into the internal 
and organizational characteristics of social movements that may help them to 
bring about (policy) outcomes and hence to become successful. A second 
strand has tried to put the movements in their larger social and political envi- 
ronment, in particular by examining the role of public opinion and political op- 
portunity structures as intervening factors mediating the movement-outcome 
nexus. To do so, scholars have relied on various typologies of outcomes. The 
best known 1s certainly the one proposed by Gamson (1990), who has defined 
success as a set of outcomes that fall into two basic clusters: the acceptance of a 
challenging group by its antagonists as a valid spokesman for a legitimate set 
of interests, and the gain of new advantages by the group's beneficiary during 
the challenge and its aftermath. By combining these two dimensions, the 
author has defined four possible outcomes of a challenge: 1. full response, 2. ` 
preemption, 3. co-optation, and 4 collapse. Unfortunately, this typology is not 
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fully exploited in the empirical analyses, which remain for the most part con- 
fined to the two-fold distinction between acceptance and new advantages. I 
have lingered on Gamson’s main findings earlier. What matters here is to see 
how his simple typology has influenced much subsequent research. In some 
way, on the other hand, it has also put some limits to research, for it brought the 
focus on the organizations instead of on the broader cycles of protest, which 
may include various movements whose combined effect might be more impor- 
tant than the impact of a single challenging group (Tarrow 1994). 

Several authors have adopted the distinction between acceptance and new 
advantages or have given a revised version of it. Among the former are obvi- 
ously those who have reanalyzed Gamson's original data (Frey et al 1992, 
Goldstone 1980, Mirowsky & Ross 1981, Steedly & Foley 1979). Webb and 
several collaborators also built on Gamson's typology and work, but used a 
different dataset (Webb et a] 1983). Amenta et al (1992), on the other hand, de- 
fined three levels of success in an attempt to elaborate on Gamson's typology: 
co-optation or the recognition from opponents or the state, gains in policies 
that aid the group, and the transformation of challengers into a member of the 
polity. Within each type, in addition, there are various degrees of success. 
Here, however, we begin to see the dangers entailed in the use of the notions of 
success and failure. First, such a perspective assumes that social movements 
are homogeneous and hence tends to attribute success or failure to an entire 
movement, unless one focuses on single organizations as Gamson did. Yet of- 
ten there is little agreement among movement leaders and participants, even 
within a given organization, regarding which goal must be pursued. Second, as 
itis not always uniformly evaluated by everyone, success raises the question of 
subjectivity. Movement participants and external observers may have differ- 
ent perceptions of what counts as success, and the same action may be judged 
as successful by some participants and as failed by others. Finally, the notion 
of success 1s problematic because it overstates the 1ntention of participants. 
Once again, while social movements are rational efforts to bring about change, 
many oftheir consequences are unintended and often unrelated to their claims. 

These ambiguities notwithstanding, numerous scholars have looked at the 
determinants of movement success or failure (e.g. Amenta et al 1992, Banas- 
zak 1996, Brill 1971, Burstein et al 1995, Frey et al 1992, Gamson 1990, Gold- 
stone 1980, Mirowsky & Ross 1987, Nichols 1987, Perrot 1987, Piven & 
Cloward 1979, Shorter & Tilly 1971, Steedly & Foley 1979). Therefore, most 
of the existing typologies are framed, explicitly or implicitly, in terms of suc- 
cess. 

Here are only a few examples (a) Rochon & Mazmanian (1993) added a 
third type of impact to Gamson’s distinction, thus defining three arenas of 
movement success: policy changes (new advantages in Gamson’s terminol- 
ogy), changes in the policy process (Gamson’s acceptance), and changes in so- 
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cial values. (b) Drawing both from the social-problems literature and the 
public-policy literature, Schumaker (1975) defined five criteria of government 
responsiveness to movement demands: access, agenda, policy, output, and im- 
pact. (c) Rüdig (1990) used this typology ın his comprehensive study of the an- 
tinuclear movement worldwide. (d) Burstein et al (1995) also relied on this ty- 
pology, pointing out correctly that it addresses several aspects of the political 
process that had previously been left out. However, they added structural ef- 
fects as a sixth type of government responsiveness, thus acknowledging that 
movement can provoke alterations in the institutional arrangements of society. 
(e) Kitschelt (1986) also stressed structural effects, i.e. a transformation of the 
political structures, in addition to procedural effects (Gamson's acceptance) 
and substantive effects (Gamson's new advantages). This typology allows for 
a link between the outcomes of social movements and their political context. 
(f) In quite a similar way, Gurr (1980) had previously defined three types of 
outcomes of violent conflicts: effects on the group fate, policy changes, and 
societal or systemic effects. The advantage of this typology is that it makes a 
clear distinction between internal effects on the movement and external effects 
on policy or the larger society. (g) Kriesi (1995) added a further distinction to 
Kitschelt's typology by defining two types of substantive impact: reactive ef- 
fects, i.e. the prevention of “new disadvantages,” and proactive effects, i.e. the 
introduction of “new advantages.” This distinction is relevant with regard to 
political opportunity structures, for it allows us to link social movement out- 
comes to the strength ofthe state and has been used to investigate the outcomes 
of Western European new social movements (Giugni 1995). (A) Finally, Rucht 
(1992) acknowledged the need to distinguish between goal-related outcomes 
and broader consequences by classifying the effects of social movements ac- 
cording to two dimensions: internal vs external and intended vs unintended. 
Gurr's and Rochon & Mazmanian's typologies present a further advantage: 
They acknowledge the possibility that different types of outcomes be related to 
each other. This is an important point. Gurr (1980), for example, suggested 
that group changes and systemic changes be seen as ultimate outcomes that 
take place through policy changes, which, in turn, are the proximate result of 
violent conflicts (Gurr 1980). Rochon & Mazmanian (1993) maintain that sub- 
stantial gains may be more easily obtained once a challenging group has 
reached some degree of acceptance. Other authors have similarly explored 
how social movements can make a greater impact by pursuing goals in admin- 
istrative agencies and courts once they have achieved policy responsiveness 
(e.g. Burstein 1985, 1991, Handler 1978, Sabatier 1975). A recent interesting 
variant has been proposed by Dian (1997), who claims that when movements 
are able to facilitate the emergence of new social networks they will be more 
influential in processes of political and cultural change. Here, we abandon the 
classificatory terrain to begin to reason in terms of relationships between vari- 
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ables. In other words, it is the beginning of a theory of movement outcomes. 
Unfortunately, very little research has been done to show how a certain type of 
impact can help to bring about another type. In this, however, we have another 
interesting avenue for future research. 

Several authors have stressed the methodological problems that have been 
preventing social scientists from systematically analyzing the consequences of 
the presence and action of social movements, including the problem of causal 
attribution, the problem of time reference and effect stability, the problem of 
movement goal adaptation, the problem of interrelated effects, and the prob- 
lem of unintended and perverse effects (Rucht 1992 see further Giugni 1994, 
Gurr 1980, Snyder & Kelly 1979). Although this is not the place to propose so- 
lutions to these and related methodological problems, it would perhaps help to 
point out a logical puzzle that lies uphill, the recognition of which would make 
the task of setting research agenda easier. It has to do with the blurring of some 
fundamental distinctions between types of potential effects of movements. 
The vastmajority of the existing studies deal with effects that are related to the 
movements' stated programs and ends. The Brent Spar case mentioned at the 
outset 1s a good example: A declared goal by a challenging group is reached, 
allegedly as a result (at least in part) of the group's actions. But only under ex- 
ceptional circumstances do movement actions have such an immediate and 
successful impact. Most of the time, movements promote their programs cu- 
mulatively over months and even years of claim-making (C Tilly, 19982). This 
makes the analysis much more complicated. Yet most research has focused on 
outcomes of social movements, which we may define as a special case of the 
more general set of their consequences: those that relate directly to the goals 
and ends of challengers. 

Even more narrowly, work on outcomes has usually looked at the impact of 
movements on government policy or legislation (e.g. Amenta et al 1992; 
Banaszak 1996; Burstein 1979a, 1985; Burstein & Freudenburg 1978; Button 
1978, 1989; Costain & Majstorovic 1994; Gelb 1989; Gelb and Palley 1987; 
Huberts 1989; MacDougal et al 1995). Three only partly correct assumptions 
are perhaps at the origin of this strong focus on policy outcomes. First, the 
view held by the political process approach that social movements are essen- 
tially targeting political authorities and institutions and, hence, they are mainly 
aimed at provoking political change. While such a definition covers a crucial 
aspect of the national social movement and is widely adopted in the literature 
(McAdam et al 1996, Tarrow 1994, Tilly 1984), contemporary movements of- 
ten address the larger public, aiming, for example, to change attitudes and 
opinions on a given matter. In addition, other authors have warned us about the 
dangers of restricting our attention to the political side of new social move- 
ments, as they have identity-related goals that do not necessarily require a po- 
litical target (Melucci 1996). Second, and related to the first point, the eager- 
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ness to find the causes of movement success or failure, an attitude facilitated 
by the activist past of many scholars and by a sympathetic stand toward many 
contemporary movements. Third, the conviction that policy changes are more 
easily measured than cultural changes. The latter reason would explain why 
we still have rather few studies on the cultural aspects of movements except for 
the individual-level consequences of participation in social movements and 
activism, on which there is a considerable body of literature (e.g. Abramowitz 
& Nassi 1981, Demerath et al 1971, Fendrich 1974, 1977, Fendrich & Krauss 
1978, Fendrich & Lovoy 1988, Fendrich & Tarlau 1973, Jennings 1987, Jen- 
nings & Niemi 1981, Marwell et al 1987, McAdam 1988, 1989, 19982, Nassi 
& Abramowitz 1979, Whalen & Flacks 1980). 

To be sure, there is work on what may be seen as instances of the cultural 
impact of movements, such as their spillover effects from one movement to the 
other (Meyer & Whittier 1994), their capacity to generate social capitals (Di- 
ani 1997), their impact on the media (Gamson & Wolfsfeld 1993), and so forth, 
but these are rather sporadic tn comparison to the huge amount of works on 
policy outcomes. Other authors, on the other hand, have looked at the cultural 
determinants of movement success as measured through policy or legislative 
change (e.g. Banaszak 1996), thus reversing the causal arrow. 

Studying the ways in which social movements have their demands met is, of 
course, a legitimate endeavor that will help improve our knowledge of the 
causal processes involved in social and political change. Yet, like all kinds of 
actions, the effects of social movements are often indirect, unintended, and 
sometimes even in contradiction to their goals (on the unintended conse- 
quences of social action, see Tilly 1996). Increased repression, for example, is 
often an immediate effect of protest, but the long-term consequences may be 
differenct (della Porta 1995). Tarrow (1989, 1993, 1994) goes precisely in this 
direction when he looks at the broad repercussions of cycles of protest, includ- 
ing cycles of reform. In his study of the Italian protest cycle of the 1960s and 
1970s (Tarrow 1989), the author shows that this period of disorder made a cru- 
cial impact and left a positive legacy for Italian democracy by promoting re- 
form, expanding the political arena, giving autonomy to Italian voters, and, 
above all, expanding the repertoire of the legitimate forms of political partici- 
pation. By analyzing social movements at the macro level, Tarrow established 
a link between two broad phenomena: the emergence, development, and de- 
cline of a cycle of protest, on the one hand, and political, institutional, and cul- 
tural changes, on the other hand, whereby the former plays a crucial role in 
bringing about the latter. The lesson to be drawn here 1s that both the short- 
term and the long-term consequences of movement actions must be examined 
(Andrews 1997). 

Empirical work that focuses explicitly on the unintended consequences of so- 
cial movements is quite rare (e.g. Deng 1997, Paul et al 1997). Yet, as Charles 
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Tilly (19982) has put it, “this range of effects far surpasses the explicit de- 
mands made by activists in the course of social movements, and sometimes ne- 
gates them. By any standard, ‘success’ and ‘failure’ hardly describe most of 
the effects." In addition, he maintains, third parties can act and produce 
changes in the zone of a movement's activities and 1nterests. According to 
Tilly, the difficulties of analyzing the consequences of social movements arise 
precisely from this logical situation, which he has schematized as three over- 
lapping circles. Analysts should take into consideration three sets of variables: 
1. all movement claims, 2. all effects of movements’ actions, and 3. all effects 
of outside events and actions. The overlapping of these three variables creates 
four situations that must be analytically distinguished. As Figure 1 shows, 
what I defined as outcomes, i.e. effects of movement actions that bear directly 
on movement claims, result from the overlapping of set 1 and 2. If the effects 
can be completely attributed to the movement's action, we can speak of suc- 
cess when they are positive and failure when they are negative (intersection 
A), although the problem of the differential evaluation of success remains. But 
at least a part of outcomes are produced as joint effects of movement actions 
and outside influences (intersection B). Furthermore, sometimes external 
events and actions may produce effects that satisfy movement claims (inter- 
section C). Finally, we must take into account the possibility of joint effects of 
movement actions and outside influences that do not bear directly on move- 
ment claims, 1.e. unintended consequences. Once we have posed the funda- 
mental logical problem so nicely illustrated by Tilly, we will be in a better po- 
sition to build causal theories about social movements, their success or failure, 
their outcomes, and the broader consequences of their actions. 


CONCLUSION 


As the review of the relevant literature reveals, much work on the impact of 
social movements and protest behavior was done during the seventies. The 
spark was provided by the wave of student and antiwar protest as well as the ri- 
ots that occurred in American cities during the sixties. The latter, in particular, 
incited American scholars to inquire not only into the causes, but also the con- 
sequences of violent political behavior. European scholars, on the other hand, 
have usually privileged the broad processes that have led to the emergence of 
the new social movements, hence paying only little attention to their repercus- 
sions on society, especially in empirical research. Subsequently, the interest in 
the effects of movements has somewhat waned. It resurfaced recently, how- 
ever. Two forthcoming collective volumes (M Giugni et al 1998a,b) and re- 
cently published works and ongoing studies, testify to this renewed interest in 
the consequences of social movements, which stems less from the need to un- 
derstand current practices in society, such as riot behavior 1n urban settings, 
than from the willingness to fill an important gap in the social movement ht- 
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A= EFFECTS OF MOVEMENT ACTIONS THAT BEAR DIRECTLY ON MOVEMENT CLAIMS 


B~- JOINT EFFECTS OF MOVEMENT ACTIONS AND OUTSIDE INFLUENCES THAT BEAR 
DIRECTLY ON MOVEMENT CLAIMS 


C» EFFECTS OF OUTSIDE INFLUENCES (BUT NOT OF MOVEMENT ACTIONS)THAT BEAR 
DIRECTLY ON MOVEMENT CLAIMS 


D= JOINT EFFECTS OF MOVEMENT ACTIONS AND OUTSIDE INFLUENCES THAT DON’T BEAR 
ON MOVBMENT CLAIMS 


Source Tilly (1998) 


Figure 1 The problem of identifying social movement outcomes 


erature. As such, it is less focused on those characteristics and features shown 
by the phenomena currently under way and more genuinely aimed at unveiling 
the processes and dynamics that allow movements to make an impact on dif- 
ferent aspects of society. This alone gives us some reassurances that more at- 
tention will be paid in the future to crucial consequences of social movements 
previously neglected. I am referring in particular to their potential for influenc- 
ing processes of broader cultural and institutional change. 

An agenda for future research should focus on the comparative study of the 
outcomes and consequences of social movements. Comparisons between dif- 
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ferent political contexts, different movements, and different periods will shed 
light over the causal dynamics 1nvolved in processes of social and political 
change. A promising way to do so is to adopt a historical comparative design 
aimed at analyzing concordances and differences in order to generate explana- 
tions. Specifically, we would have much to gain from conducting in-depth 
comparisons of different national cases and different movements over a rela- 
tively long period, thereby comparing interactions that allow distinct move- 
ments to have a given type of consequence in different countries. By analyzing 
movement consequences following a comparative design, in addition, we will 
be able to avoid the formulation of invariant models that serve so badly the 
need of social sciences (Tilly 1995). In addition, as Tilly (19982) has correctly 
put it, the study of the outcomes and consequences of social movements im- 
plies, and indeed requires, the analysis of movement interactions and dynam- 
ics. If we do not pay careful attention to such interactions and dynamics, the 
methodological problems I have pointed out will always render our analyses 
weak and our conclusions shaky. If we do not first clarify the dynamics that 
have led hundreds of thousands of people to challenge the Communist regimes 
in Eastern Europe, we will hardly be able to establish whether those protests 
were instrumental in the dramatic changes that occurred and how. Similarly, if 
we do not first shed light on the interactions between Greenpeace activists, po- 
litical elites and institutions, public opinion, and Shell’s leaders, we will find it 
difficult to attribute the company's decision to destroy the Brent Spar oil rig to 
the environmentalists’ outraged call for a boycott. After all, without interac- 
tions there are simply no outcomes or consequences. 
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ABSTRACT 


People have a tendency to marry within their social group or to marry a per- 
son who is close to them in status. Although many characteristics play a role 
in the choice of a spouse, sociologists have most often examined endogamy 
and homogamy wnth respect to race/ethnicity, religion, and socioeconomic 
status. I first give an overview of hypotheses on the causes of endogamy and 
homogamy. The various hypotheses that have been suggested in the litera- 
ture can be distinguished as arguments about three more general factors. (a) 
the preferences of marriage candidates for certain characteristics 1n a spouse, 
(b) the interference of “third parties” in the selection process, and (c) the con- 
straints of the marriage market in which candidates are searching for a 
spouse. Second, I summarize empirical] research by answering four ques- 
tions: (a) To what extent are groups endogamous and how do groups differ in 
this respect? (b) How has endogamy changed over time? (c) Which factors 
are related to endogamy? (d) How do various dimensions of partner choice 
coincide? Third, I discuss strengths and weaknesses of past research. Strengths 
include the mass of descriptive work that has been done and the development 
of a multifaceted theoretical perspective which gives sociological theorizing 
an edge over psychological and economic theories of partner choice. Weak- 
nesses include the lack of standardization of methods 1n describing patterns 
and trends and the relatively weak integration of empirical and theoretical 
work. 
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INTRODUCTION 


Since the beginning of this century, sociologists have described patterns of part- 
ner choice and have tried to explain why people marry within their group (en- 
dogamy) and why people marry persons close in status (homogamy). Thé re- 
search literature can be divided into three traditions, depending on which type 
of characteristic is considered. Research on ethnic and racial intermarriage 
originated in immigrant countries such as the United States and is motivated 
by the question of whether the various nationality groups would integrate with 
one another and with the original population (Drachsler 1920; Wirth & Gold- 
hamer 1944). Research on religious intermarriage has been done both in and 
outside the United States and has been concerned with the extent to which 
churches control the life choices of their members and the degree to which re- 
ligious involvement translates into the membership of “communal groups” 
(Kennedy 1944). Research on socioeconomic homogamy was developed by 
stratification researchers who used marriage patterns in conjunction with mo- 
bility patterns to describe how open stratification systems are (Glass 1954). 

Although the underlying issues are diverse, one common theme is that all 
traditions characterize social differentiation by describing patterns of social 
interaction. Building on the Weberian notion of status group closure, students 
have argued that interaction between social groups provides a fundamental 
way to describe the group boundaries that make up the social structure. Be- 
cause marriage is an intimate and often long-term relationship, intermarriage 
or heterogamy not only reveals the existence of interaction across group 
boundaries, it also shows that members of different groups accept each other as 
social equals. Intermarriage can thus be regarded as an intimate link between 
social groups; conversely, endogamy or homogamy can be regarded as a form 
of group closure. 

Another common theme lies in the consequences of intermarriage. First, in- 
termarriage decreases the salience of cultural distinctions in future generations 
because the children of mixed marriages are less likely to identify themselves 
with a single group. Although mixed couples may socialize their children into 
the culture of a single group, these children are less likely to identify with that 
group when intermarriage in society is common. Second, by intermarrying, in- 
dividuals may lose the negative attitudes they have toward other groups. Al- 
though personal interaction between groups sometimes fosters conflicts by 
making economic and cultural differences more apparent, if the relationship is 
intimate, interaction gives people an opportunity to realize the individual vari- 
ety among the members of another group and, in doing so, may ultimately 
weaken their prejudices and stereotypes. Because intermarriage often con- 
nects the social networks ofthe two spouses, this applies to a range of outgroup 
members and not just to the immediate partners. 
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In short, what makes intermarriage sociologically relevant lies in its mher- 
ent dynamic: It is not just a reflection of the boundaries that currently separate 
groups in society, it also bears the potential of cultural and socioeconomic 
change. While marriage patterns are in this sense telling social indicators, they 
do not tell us everything. First, if members of two groups do not marry one an- 
other, it does not necessarily mean that both groups are closed. It takes two to 
marry, and if one group is closed while the other is open, endogamy may still 
prevail. Research on marriage is less informative in this respect than, for in- 
stance, research on individual racial prejudice. In a similar vein, homogamy 
tells a somewhat ambiguous story about the preferences and prejudices of 
status groups. Homogamy will occur if people prefer to marry into high-status 
groups, but it will also occur when people prefer to marry status-equals. In 
high-status groups, preferences for high-status spouses and preferences for 
status-equals are similar, but in lower-status groups, these are different. 

Second, marriage patterns result from both preference and opportunity. Op- 
portunity to marry within the group depends on many factors, such as residen- 
tial segregation, the composition of local marriage markets, group size, and so 
on. Ás a result, endogamy does not necessarily point to a personally felt social 
distance toward a certain outgroup. Such preferences play a role, but to what 
extent they determine the actual choices people make is an empirical question. 
Marriage patterns simply tell us which groups interact with whom, and while 
this is an important piece of information, they do not tell us why. 

A third and final limitation of marriage patterns lies in demographic trends. 
Declining marriage rates, the rise of cohabitation, and the increase in divorce 
suggest that it is not always valid to treat marriage patterns as indicators of dif- 
ferentiation in society as a whole. Some of these problems can be solved more 
easily than others. The rise of cohabitation poses no real problem because one 
can often include cohabiting couples in the analysis. Declining marriage rates 
are also less of a problem because they are largely the result of marriage de- 
lays; the vast majority of a given birth cohort eventually marries. The rise of 
divorce is more problematic, because intermarriage and divorce are often posi- 
tively related. A high rate of ethnic intermarriage may point to open social 
groups, but if mixed marriages are more likely to break up, such a conclusion 
would need further study. 

In the past decades, researchers have described patterns of intermarriage, 
examined individual variations in intermarriage, and assessed changes in 1n- 
termarriage over time. In addition, both theoretical and empirical studies have 
developed hypotheses about why people marry within their group and why 
some do while others do not. Because such hypotheses are often not tested di- 
rectly, I divide my review into a theoretical and an empirical section. The goal 
of the theoretical section is to review micro- and macro-level hypotheses about 
the causes of intermarriage and homogamy and to put these into a general theo- 
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retical framework. The goal of the empirical section is to summarize patterns, 
variations, and trends in intermarriage. I focus on the three main sociological 
group characteristics (i.e. race and ethnicity, religion, and socioeconomic 
status), I limit myself to Western societies, and I discuss studies conducted in 
the last decade. 


THEORETICAL WORK ON INTERMARRIAGE AND 
HOMOGAMY 


Marriage patterns arise from the interplay between three social forces: the 
preferences of individuals for certain characteristics 1n a spouse, the influence 
of the social group of which they are members, and the constraints of the mar- 
riage market in which they are searching for a spouse (Kalmijn 1991b). Al- 
though these factors represent analytically distinct hypotheses, they have most 
often been regarded as complementary elements of a single theory, and that is 
what distinguishes the sociological perspective from economic or psychologi- 
cal theories on partner choice (e.g. Winch 1958). 


Preferences of Marriage Candidates 


To understand aggregate patterns of marriage selection, researchers use the 
concept of a marriage market. Unmarried men and women operate within a 
marriage market where each individual considers a set of potential spouses. 
Potential spouses are evaluated on the basis of the resources they have to offer 
and individuals compete with each other for the spouse they want most by of- 
fering their own resources in return. Several kinds of resources obviously play 
a role in the choice of a spouse, but sociologists have mostly focused on socio- 
economic and cultural resources. When married, spouses pool these resources 
to produce family goods, such as economic well-being, status, social confir- 
mation, and affection. 


SOCIOECONOMIC RESOURCES Socioeconomic resources are defined as re- 
sources that produce economic well-being and status. Economic well-being is 
shared by the family members and status is granted to the family as a unit 
rather than to its individual members. As a result, the income and status of one 
spouse contribute to the income and status of the other by raising the income 
and status of the family. People maximize their income and status by searching 
for a spouse with attractive socioeconomic resources. The outcome of this 
competition 1s that the most attractive candidates select among themselves 
while the least attractive candidates have to rely on one another. Competition 
for socioeconomic resources on the marriage market thus leads to an aggregate 
pattern of homogamy. 

The nature of this competition varies with the role women play in society. 
When marriage is based on the benefits that stem from the division of paid and 
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domestic labor in the household, prevailing gender differences in earnings 
give men a comparative advantage in productive labor so that the wife's time is 
used more productively when it 1s spent on household labor. As a result, men 
and women exchange paid and domestic labor resources. Similar arguments 
have been made with regard to status and prestige. When the status of the fam- 
ily depends primarily on the occupation of the husband, there will be an ex- 
change of male prestige and female qualities in other respects, such as class 
background, physical attractiveness, and cultural participation (Jacobs & Fur- 
stenberg 1986; Stevens et al 1990; Uunk 1996). 

Both types of exchange suggest that men, unlike women, do not compete 
among themselves for female socioeconomic resources in the marriage mar- 
ket. There are good reasons to believe that this has changed. An increasing 
number of married women participate in the labor market and married 
women's work is now less often motivated by temporary economic needs of 
the family. Several authors believe that these changes have made women's so- 
cloeconomic resources increasingly attractive to men. The wife's human capi- 
tal may facilitate the husband's access to networks that are helpful in his ca- 
reer, her earnings may subsidize his human capital investments, and the eco- 
nomic security she provides may lessen his need to settle for short-term career 
benefits, thus increasing his opportunity to choose more attractive, long-term 
career objectives. Because female labor is now often the reflection of women's 
desire to work outside the home, rather than a reflection of the economic needs 
of the family, the wife's socioeconomic resources may also become increas- 
ingly important for the status of the family (Davis 1984). 


CULTURAL RESOURCES While the importance of socioeconomic resources is 
based on a preference to marry a resourceful spouse, independent of one's own 
resources, the role of cultural resources 1s based on a preference to marry 
someone who is similar. Preferences for cultural similarity have been ad- 
dressed most extensively in the social psychological literature on personal at- 
traction (Byrne 1971). Similarity of values and opinions leads to mutual con- 
firmation of each other's behavior and worldviews, similarity of taste is attrac- 
tive because it enlarges opportunities to participate in joint activities, and simi- 
larity of knowledge creates a common basis for conversation, which enhances 
mutual understanding. 

Although originally developed to explain attraction between strangers in 
day-to-day interaction, these notions have also been applied to marriage (Di- 
Maggio & Mohr 1985; Kalmijn 1994). Because cultural similarity leads to per- 
sonal attraction, it ıs a prerequisite for getting involved with someone. Because 
of its instrumental effects, cultural similarity also encourages people to estab- 
lish a long-term relationship. Since many activities in marriage are joint, such 
as the raising of children, the purchase ofa house and other consumer durables, 
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and the spending of leisure time, dissimilarity in taste would complicate these 
shared activities. More generally, people prefer to marry someone who has 
similar cultural resources because this enables them to develop a common life- 
style in marriage that produces social confirmation and affection. 


PREFERENCES AND HOMOGAMY Preferences for socioeconomic and cultural 
resources do not by themselves translate into homogamy and endogamy with 
respect to social characteristics. Some authors argue that social characteristics 
are correlated with such resources, and that homogamy or endogamy is the un- 
intended by-product of individual preferences for resources in a partner. This 
argument has often been made for educational homogamy, because education 
is not only strongly related to income and status, but also to taste, values, and 
lifestyles (Kalmijn 1991a). Similar arguments can be made for horizontally 
differentiated groups, such as ethnic groups, although in this case, endogamy 
is probably more the result of preferences for cultural similarity and not so 
much the result of competition for economically attractive spouses. 

Other authors argue that social characteristics are more than simply corre- 
lates of the resources partners bring to the marriage market. Characteristics 
such as education, occupation, race, and ethnicity are also seen as badges that 
individuals wear to show others what kind of person they are. In this perspec- 
tive, spouse selection is regarded as a filter process. In the first step, people de- 
velop a network of friends, acquaintances, and possibly marriage candidates 
with whom they share some objective social characteristic. In the second step, 
people find their spouse by interacting within these homogeneous networks. 
The second step is also the phase in which psychological characteristics come 
into play, but at that time, homogamy with respect to objective social charac- 
teristics 1s already insured (Murstein 1976). 


Third Parties 


A second hypothesis about why people marry within their group focuses on 
people who are not directly involved in the marriage. Because mixed mar- 
riages may threaten the internal cohesion and homogeneity ofthe group, "third 
parties" have an incentive to keep new generations from marrying exoga- 
mously. There are two ways in which third parties prevent exogamy: by group 
identification and by group sanctions. 


GROUP IDENTIFICATION Children are typically brought up with a sense of 
group 1dentification. Identification either takes the form of an awareness of a 
common social history, what is sometimes called a "sense of peoplehood" 
(Gordon 1964), or it can take the form of a more psychological sense of being 
different from others. The stronger such feelings of group identification, the 
more people have internalized norms of endogamy, and the more likely it is 
that they marry homogamously or endogamously. The notion of group identi- 
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fication has been especially important for racial and ethnic groups, where 
norms of endogamy are believed to be firmly internalized (Merton 1941). Such 
norms, however, may also apply to other kinds of groups such as social classes 
and educational groups. 

How strongly younger generations 1dentify themselves with the group de- 
pends to a great extent on the homogeneity of the networks in which they are em- 
bedded. When adolescents live in neighborhoods that are homogeneous with 
respect to the social and cultural characteristics of their parents, they are more 
likely to develop a sense of belonging to that group. While residential segrega- 
tion in urban areas hampers opportunities to intermarry directly, as is dis- 
cussed later, it also reduced exogamy by intensifying feelings of group solidar- 
ity. Identification with the origin group 1s believed to be weakened by higher 
education. Owing to the emphasis on individual achievement and universalis- 
tic principles in higher education, the college-educated may be less likely to 
identify themselves with their social and cultural roots (Hwang et al 1995). 


GROUP SANCTIONS Even if people have not internalized norms of endog- 
amy, they may still refrain from marrying exogamously because of the sanc- 
tions third parties apply. The three most important examples of parties that 
sanction intermarriage are the family, the church, and the state. Although in 
Western societies parental control over children's marriage decisions is lim- 
ited, there are still ways in which parents can interfere. They set up meetings 
with potential spouses, they play the role of matchmaker, they give advice and 
opinions about the candidates, and they may withdraw support in the early 
years of the child's marriage. Nevertheless, they do not have strong sanctions 
when children decide against their will. 

Somewhat stronger sanctions are provided by the church. Both the Catholic 
church and various Protestant denominations have denounced interfaith mar- 
riages for centuries, although the nature and strength of their disapproval have 
changed over time. Religious institutions attempt to control intermarriage in 
part because they are competing for members. Religious intermarriage entails 
the risk of losing members and may weaken church attachment in future gen- 
erations. If interfaith marriages occur anyway, it is not always in the interest of 
the church to apply sanctions because the competing church may accept the 
marriage and hence gain members. This helps explain why the Roman Catho- 
lic church, for example, has often accepted interfaith marriages on the condi- 
tion that the children be raised as Catholics. Because of competing pressures, 
however, spouses in interfaith marriages often decide not to raise their children 
in a religious fashion. If this occurs, both religious institutions stand to lose 
strength in society. 

The strongest sanctions against intermarriage have been provided by the 
state. Laws on racial intermarriage in the United States—abolished in 
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1967—are a well-known example (Davis 1991). When slavery was abolished, 
the gradual decline in formal inequality of blacks and whites went hand-in- 
hand with a growing anxiety about the social boundary between the races, and 
this anxiety was stronger when contacts were more intimate. Interracial dating 
and marriage were condemned with great vigor, and strong social norms 
emerged against interracial contacts with possible sexual undertones, such as 
interracial dancing and swimming. The emerging doctrine of no social equal- 
ity was formalized in legislation that segregated the races in public facilities 
(Jim Crow laws) and legislation that controlled their sexual and marital con- 
tacts (antimiscegenation laws). 


Marriage Markets 


Endogamy and homogamy are not only governed by individual- and group- 
level factors, but also by structural arrangements. The chances to marry en- 
dogamously are higher the more often one meets people within the group and 
the more often one interacts with group members on a day-to-day basis. Con- 
tact opportunities are shaped by several structural arrangements. Some studies 
focus on the demographic composition of the population as a whole, other 
studies examine regional distributions of groups, and yet other studies analyze 
smaller, functional settings, such as the school and the workplace. 


THE LOGICOF NUMBERS When interaction occurs randomly, the chance that a 
woman in a certain group marries someone 1n her own group equals the pro- 
portion of men who are in that group. As a result, members of a small group 
will have lower chances of marrying endogamously than members of a larger 
group. The effect of group size implies that endogamy is negatively related to 
the degree of heterogeneity of a population (Blau & Schwartz 1984). To ex- 
plain this, one can think of two populations, each consisting of two groups. 
One population is heterogeneous and has 50% in each group (e.g. 100 in group 
A, 100 ın group B), while the other is homogeneous and has 90% in one group 
and 10% in the other (e.g. 180 in group A, 20 in group B). Both populations 
have equal numbers of males and females in each group. In the heterogeneous 
population, the number of women expected to marry within the group will be 
0.5 x 50—25 for A and 0.5 x 50 = 25 for B, which boils down to 50% marrying 
within the group. In the homogeneous population, the number of women ex- 
pected to marry within the group will be 0.9 x 90 = 81 for A and 0.1 x 10=1 for 
B, which boils down to 82%. This shows that in a heterogeneous population, 
endogamy is lower than in a homogeneous population, provided that marriage 
is random. 


THE GEOGRAPHY OF GROUPS The chance to encounter a member of one's 
own group does not depend on group size alone but also on the way a group is 
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dispersed geographically (Blau & Schwartz 1984). Groups that are concen- 
trated in specific regions of the country generally have more opportunity to 
marry endogamously than groups that are not (Lieberson & Waters 1988). Ex- 
amples are common in the literature on ethnic groups, e.g. Asian-Americans in 
California, Jewish-Americans in New York City, or Catholics and Protestants 
separated in the southern and northern parts of the Netherlands. An additional 
reason why it is important to consider the geography of groups is that isolation 
may be correlated with group size. Smaller groups are often more isolated. 
Jewish-Americans, for example, may have partly overcome the constraints of 
their small group size through geographic concentration. They are a small 
group in a large country, but a large group in a small region. 

Although relaxing the assumption of an even geographic distribution is 
more realistic, it also leads to new problems. If one controls for geographic 
segregation—by calculating endogamy rates for specific regions, for in- 
stance—one implicitly assumes that people base their decision to live in a 
given area on factors that are independent of ingroup preferences. This is not 
always realistic. For instance, there is much regional concentration of Italian- 
Americans in the United States, but even though this can in part be attributed to 
their particular immigration history and occupational opportunities, the prefer- 
ences of Italian-Americans play a role as well (Lieberson 1980). While it is dif- 
ficult to make a precise distinction between preferences and constraints, it is 
generally true that the smaller the marriage market one studies, the more the 
structure of the market is affected by preferences and the less by constraints. 


LOCAL MARRIAGE MARKETS  Unmarried people do not just wander around a 
region looking for a spouse; they spend most of their life in small and func- 
tional places, such as neighborhoods, schools, workplaces, bars, and clubs. 
Such "local marriage markets" are often socially segregated, and that is why 
they are important for explaining marriage patterns. In the sociological litera- 
ture, three local markets have been considered most frequently: the school, the 
neighborhood, and the workplace. Of these three, schools are considered the 
most efficient markets because they are homogeneous with respect to age and 
heterogeneous with respect to sex. Workplaces are considered less efficient, 
but increased participation of women in the labor market and declining occu- 
pational sex segregation suggest that this may have changed (Davis 1984). Al- 
though it has not often been studied where couples meet, a French study shows 
that the settings sociologists analyze are not the most common meeting places. 
Among young French couples, fewer than 596 met in the neighborhood, fewer 
than 1096 met at school, and just over 1096 met at work (Bozon & Heran 1989). 

To clarify how local marriage markets affect homogamy, authors have 
looked at the composition of these markets with respect to social characteris- 
tics. What distinguishes the neighborhood from the school and the workplace 
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is that it is homogeneous with respect to factors such as ethnicity, race, reli- 
gion, and family background, i.e. characteristics transmitted by parents (Lie- 
berson 1980). Schools are less homogeneous in ascribed characteristics, al- 
though there are exceptions, e.g. Catholic colleges and black colleges. At the 
same time, schools are not necessarily homogeneous with respect to educa- 
tional attainment. Differences in ultimate educational attainment are larger in 
high schools, for example, than in universities, simply because the educational 
system works like a funnel, particularly in the United States (Mare 1991). In 
general, however, it is expected that colleges promote educational homogamy 
more than neighborhoods do, while neighborhoods promote ethnic endogamy 
and homogamy of family background more than schools. Whether workplaces 
encourage homogamy highly depends on the type of work, but on average, 
they probably do not encourage socioeconomic homogamy as much as schools. 


EMPIRICAL WORK ON INTERMARRIAGE AND 
HOMOGAMY 


Empirical work has addressed four questions: (a) To what extent are groups 
endogamous or homogamous, and how do groups differ in these respects? (b) 
How have endogamy and homogamy changed over time? (c) Which factors 
are related to endogamy and homogamy, and in particular, what is the role of 
gender, education, and geographic regions and local marriage markets? (d) 
How do various dimensions of partner choice coincide? Before I summarize 
the main findings, I discuss how researchers have tackled these issues method- 
ologically. 


Measures and Models 


Intermarriage can be calculated for the stock of marriages at a given point in 
time (prevalence measures) or for people who marry 1n a given period of time 
(incidence measures). Incidence measures are generally preferable, in particu- 
lar if one analyzes trends. If the stock of marriages is used, one can analyze 
characteristics at the time of survey or characteristics at the time of marriage. 
The latter measures are more suitable than the former because some character- 
istics change after marriage. Because partners may become more alike during 
marriage—they may switch faith, for example, or influence each other's occu- 
pational career—current measures of homogamy tend to be biased upwardly. 
To describe intermarriage, various measures have been used. To explain these, 
it is helpful to consider the following marriage table. 


MEASURES The most general measure is the percentage of couples intermar- 
rying: (CBA*CABYN. When calculating group-specific measures, it makes a 
difference if one considers couples or individuals. The percentage of A-type 
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FEMALES 
Group A Group B Total married 
Group A CAA Cas Ma 
MALES Group B Cpa Cap Mp 
Total married FA Fp N 


couples intermarrying is (CBA+CaB)/(CBAt+tCABt+CAA), while the percentage 
of A-type married persons intermarrying is CAB/MA for males and Cpa/Fa for 
females. While percentages are simple and informative measures to describe 
intermarriage, they provide little information about the strength of endogamy 
because they lack a reference point. If 40% of a group marries endogamously, 
is this evidence for a preference to marry within rather than outside the group? 
Percentages are also less useful for comparing groups because when selection 
is random, small groups are less likely to marry within their group than large 
groups. 

These problems are overcome by a more recent measure, the odds ratio. The 
odds ratio is defined as the odds that an A-type male marries an A-type female 
(rather than a B-type female), divided by the odds that a B-type male marries 
an A-type female, i.e. (CAA/CAB)(CnBA/Cnn). The odds ratio for women is 
equivalent, i.e. (CAA/CpAY(CAnB/Cpnp). If there are more than two groups in 
the marriage table, one can calculate odds ratios for each group separately. If 
Cax and Cxa are marriages of A-type males and females with all other groups, 
and Cxx are marriages that do not involve A-type males or females, the odds 
ratio can be defined as (CAA/CAXY(CxA/Cxx). Odds ratios have two impor- 
tant advantages. First, they provide a reference point: Odds ratios greater than 
one indicate that there is more endogamy than one would expect, and the larger 
the ratio, the greater the degree of endogamy. Second, odds ratios are useful for 
comparing endogamy across groups because they are independent of the rela- 
tive sizes of the groups in the marriage table. 

A disadvantage of the measures discussed above is that they are based on 
the married or marrying population. A measure of intermarriage that takes into 
account that not everyone marries is the intermarriage index Z, which is based 
on so-called harmonic mean models developed by Schoen (1988). If Mp and Fp 
refer to the total number of males and females in the respective groups (mar- 
ried and unmarried), Z is defined as (CAB/MpAtCnBA/FpAt*CBA/Mpn*CAB/ 
Fpp)/(MaA/Mpat+Mp/MpptFa/FpatFp/Fpp). This intermarriage index ranges 
from zero for minimum intermarriage to one for maximum intermarriage. 
When selection is random, the index takes the value of 0.5 (Schoen 1988). 

Percentages, odds ratios, and the intermarriage index can be applied to both 
ordered and nonordered characteristics. For ordered characteristics, another 
common measure is the Pearsonian correlation between spouses’ traits. A 
positive correlation means that high-status men marry higher-status women 
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than low-status men; it does not necessarily mean that people marry within 
their group. The correlation between the ages of husband and wife, for exam- 
ple, is strongly positive, even though most men marry somewhat younger 
women. 


MODELS Second to measures of intermarriage, loglinear models have been 
used to describe patterns of marriage selection. These models assume that the 
expected counts in the marriage table are a multiplicative function of sample 
size, the number of males in a group, the number of females in a group, and an 
interaction parameter, which measures marriage selection independent of the 
marginal row and column distributions. Many ways to model the interaction 
parameter exist, but most authors present parameters for the tendency to marry 
within the group (endogamy) and parameters for the tendency to avoid inter- 
marrying when controlling for the tendency to marry within the group (inter- 
marriage). The latter parameters are often equivalent to odds ratios and have 
been described by the metaphors of distances or boundaries between groups 
(Mare 1991; Kalmijn 1991b). When characteristics are ordered, loglinear 
models also provide single measures of association that are comparable to cor- 
relations but independent of the marginal distributions, 1.e. uniform associa- 
tion models (Hout 1982). When characteristics are not ordered, special types 
of loglinear models exist that provide measures of the distances between 
groups as revealed by the marriage frequencies in the table, i.e. ea a 
tive models (Johnson 1980; Kalmijn 1993a). 


Patterns of Intermarriage and Homogamy 


In describing patterns of intermarriage and homogamy, researchers have ad- 
dressed three questions: (a) To what extent do subgroups marry endoga- 
mously? (6) If subgroups marry out, with what groups are they most likely to 
intermarry? and (c) How do subgroups compare in their degree of endogamy? 


RACE/ETHNICITY Most American studies of ethnic intermarriage analyze 
data from the perspective of the minority group and focus on specific types of 
subgroups. Several decades ago, interest largely focused on European immi- 
grant subgroups and their children. More recently, new immigrant subgroups 
such as Asian- and Hispanic-Americans are being studied, though there is a re- 
surgent interest in the descendants ofthe older immigrants, sometimes referred 
to as white ethnic groups. Intermarriage of blacks has always been studied fre- 
quently. Research on intermarriage of American Indians, in contrast, is scarce. 
Recent national estimates of the percentage of persons who are married en- 
dogamously vary around 95% for blacks (Sweet & Bumpass 1987), 75% for 
Asian subgroups (Lee & Yamanaka 1990), 65% for Hispanic subgroups (U.S. 
Bureau of the Census 1985), 45% for American Indians (Snipp 1989), and 
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25% for (unmixed) European subgroups (Alba & Golden 1986). Although 
these percentages are high, they do not indicate whether groups are endoga- 
mous. Loglinear models and harmonic mean analyses are more informative in 
this respect and show that virtually all ethnic subgroups marry within their 
group more often than can be expected under random mating (Jiobu 1988; 
Schoen & Thomas 1989; Sandefur & McKinnell 1986; Alba & Golden 1986). 

A low degree of endogamy does not necessarily imply integration; this also 
depends on patterns of outmarnage. When Hispanic subgroups marry exoga- 
mously, for example, they often marry with other Hispanic subgroups and 
hence keep a distance from the non-Hispanic white majority (Gurak & Fitz- 
patrick 1982). Asian-Americans who marry out, in contrast, rarely marry other 
Asian subgroups and instead marry with whites (Lee & Yamanaka 1990). In- 
termarriage between European subgroups also reveals meaningful patterns. In 
the first half of this century, marriage selection was characterized by a large 
distance between Western and Northern Europeans on the one hand and 
Southern, Central, and Eastern Europeans on the other (Pagnini & Morgan 
1990; Kalmijn 1993a). This finding has been interpreted as evidence of a 
boundary between the “old” and “new” European immigrants to the United 
States. 

How do subgroups compare in their degree of endogamy? Unfortunately, 
most studies focus on one type of subgroup at a time without analyzing other 
types. A broader focus is provided by Lieberson and Water, who present a list 
of odds ratios for more than 20 groups (1988). European subgroups and 
American Indians appear to have the lowest rates of endogamy, Hispanic and 
Asian subgroups have intermediate levels of endogamy, and blacks have the 
highest rates. There are also differences among European subgroups—for in- 
stance, endogamy is lower for “old” than for “new” European groups—but 
these are small when considering the range in the list. The main conclusion of 
Lieberson and Water's analysis is that groups who are more recent to the host 
society have higher degrees of closure, a regularity that fits well into assimila- 
tion theories. Blacks are the prime exception to this pattern. A similar relation- 
ship between the newness of a group and its level of endogamy is found in 
Australia (Jones & Luijkx 1996). 


RELIGION Religious intermarriage has primarily been studied in religiously 
heterogeneous societies. Some authors use current religious affiliation to 
measure intermarriage, while others use the religion in which spouses were 
raised. Endogamy is higher when current affiliation is used, because spouses 
often switch faith or lose their religion after entering a mixed marriage (Glenn 
1982). Estimates for the United States that use parental religion show that in 
the late 1970s, 62% of Catholics were married within their group, 84% of Prot- 
estants were married endogamously, and 8096 of Jews were married endoga- 
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mously (Glenn 1982). Loglinear analyses further show that both Catholics and 
Protestants have a tendency to marry within rather than outside their group; 
this is found in the United States (Kalmijn 19915), the Netherlands (Hendrickx 
et al 1991), Germany (Hendrickx et al 1994), Australia (Hayes 1991), and 
Switzerland (Schoen & Thomas 1990). Comparisons between countries using 
odds ratios show that endogamy is strongest in Ireland and Northern Ireland, as 
one would expect, and that Catholic endogamy is stronger in the United States 
than in most European countries (Klein & Wunder 1996). 

Which religious groups are most endogamous? Loglinear analyses in the 
United States indicate that Catholics are somewhat more closed than Protes- 
tants (Johnson 1980). Loglinear analyses have not included Jews, but it is safe 
to say that American Jews are more endogamous than Catholics because their 
intermarriage percentage is comparable to that of Catholics, while they are a 
much smaller group. Detailed loglinear analyses of Protestant denominations 
show that more-conservative Protestant denominations, such as the Re- 
reformed in the Netherlands (Hendrickx et al 1994) and Baptists in the United 
States and Australia (Johnson 1980; Hayes 1991) are more endogamous than 
liberal denominations. These findings are consistent with the notion of third- 
party control: Denominations and religions that are more traditional in relig- 
ious doctrine and have higher degrees of church involvement among their 
members also have the highest degree of endogamy. 

When analyzing marriages between religious groups, authors have used the 
concept of social distance and have developed loglinear models providing 
such measures. In the United States, such analyses point to the following order 
of groups: Baptists, Methodists, liberal Protestants, Lutherans, and Catholics 
(Johnson 1980). These distances have been interpreted in terms of ritual and 
regional dimensions. Groups on the left side of the continuum are more demo- 
cratic in their organization, have less detailed prescriptions of ritual, and em- 
phasize more spontaneous forms of worshiping than groups on the right side of 
the continuum. Regional patterns play a role here as well. Baptists and Meth- 
odists are concentrated in the South, for example, while Catholics are concen- 
trated in the Northeast. 


SOCIOECONOMICSTATUS  Theliterature on socioeconomic homogamy can be 
distinguished into studies of ascribed status and studies of achieved status. As- 
cribed status positions are measured by the occupational class of the father and 
the father-in-law. Achieved status positions are measured by education and oc- 
cupation. Education is used more often because it is a convenient status indica- 
tor of women and changes little after marriage. In most countries, educational 
homogamy is quite strong (about 0.55), occupational homogamy is somewhat 
weaker (about 0.40), while the correlation between husbands' and wives' class 
origins is the weakest, about 0.30 (Kalmijn 1991a; Uunk 1996). 
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Loglinear analyses provide additional insights in the association between 
partners’ status positions. Such analyses first show that people marry within 
rather than outside socioeconomic groups, although some groups are more 
closed than others. Groups at the top and the bottom of the educational hierar- 
chy are more closed than groups in the middle (Uunk et al 1996; Hendrickx 
1994). These tendencies may be due to the role of opportunity: If people at the 
bottom prefer to marry out, they can only choose higher groups and if people 
at the top prefer to marry out, they can only choose lower groups. People 
from farm background have an exceptionally high rate of endogamy, a find- 
ing that can probably be explained in terms of the social and geographic 1sola- 
tion of the rural population (Kalmijn 1991a; Uunk et al 1996; Jones & Davis 
1988). 

Next to a tendency to marry within the group, there is a tendency for mar- 
riage to become less common the farther away the two status positions are. 
Some status boundaries are harder to cross than others, however. For educa- 
tion, the strongest boundary is that between college graduates and lesser- 
educated persons (Mare 1991; Kalmijn 1991a). A common interpretation of 
this finding is that colleges function as local marriage markets that are physi- 
cally separated from settings in which lesser-educated persons are involved. 
Patterns of occupational homogamy, like patterns of intergenerational occupa- 
tional mobility, are dominated by the line that divides blue-collar and white- 
collar occupations (Hout 1982; Hayes 1993). More detailed analyses of occu- 
pational homogamy have shown that there is more homogamy with respect to 
the cultural status of occupations than with respect to the economic status of 
occupations (Kalmijn 1994). This suggests that preferences for cultural simi- 
larity are stronger than preferences for economically attractive spouses. 


Trends in Intermarriage and Homogamy 


Trends have been analyzed in three ways. (a) Some researchers compare mar- 
riage or birth cohorts at a single point in time. Such synthetic cohort studies are 
potentially biased because older cohorts have been married longer than 
younger cohorts. Cohorts differ in their rate of attrition, and this attrition may 
be selective because the likelihood of divorce 1s inversely related to homog- 
amy. (b) Other studies compare the stock of intact marriages at different points 
in time. This design has the disadvantage of containing much overlap of mar- 
riages in the points of comparison, which leads to an underestimation of linear 
trends. (c) Others, finally, compare recently formed marriages in different pe- 
riods, either through annual marriage licenses or through comparisons of new- 
lyweds in multiple surveys or censuses. This method is most suitable for ana- 
lyzing trends because it gives a picture of the changing incidence of intermar- 
riage. 
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RACE/ETHNICITY Analyses of marriage records generally reveal an increase 
in intermarriage of new ethnic groups in the last decades; this applies to His- 
panic intermarriage in New York City (Gilbertson et al 1996), to Asian inter- 
marriage in Hawaii (Schoen & Thomas 1989), and to Asian intermarriage of 
males—not females—in New York City (Sung 1990). The trend in black- 
white intermarriage has been documented for a longer time period and in more 
states. Annual marriage records in 33 states reveal that black-white intermar- 
riage has increased significantly in both northern and southern states since the 
legal ban on intermarriage was lifted, although it remains exceptionally low 
(Kalmijn 1993b). Comparisons of the 1980 and 1990 American censuses con- 
firm this conclusion (Qian 1997). Ethnic characteristics that are not included in 
marriage licenses have primarily been analyzed through synthetic cohort 
analyses. Such studies reveal growing outmarriage across birth cohorts for 
European-American groups (Alba & Golden 1986; Lieberson & Waters 1988), 
for American Indians (Eschbach 1995), and for ethnic groups 1n Australia 
(Jones & Luijkx 1996). 

The decline in ethnic endogamy has typically been interpreted from an as- 
similation perspective: Through generational replacement, national origin 
groups gradually integrate in the host society. Consistent with this perspective, 
most analyses find that the children of immigrants marry out more often than 
the immigrants themselves (Gilbertson et al 1996; Lee & Yamanaka 1990; 
Alba 1976). Because trends also occur within generations of immigrants, indi- 
vidual assimilation to the host society is not a sufficient explanation (Gilbert- 
son et al 1996; Sung 1990). An additional interpretation is that assimilation is a 
process at the macro level: When more and more members of an ethnic group 
are of the second or third generation—when an ethnic group becomes 
“older”—all generations find it easier to adapt to the host society. Another in- 
terpretation is more general in nature and points to the weakening influence of 
third parties in marriage choice and the declining importance of ascription as a 
basis of evaluating other people. 


RELIGION In the United States, trends in religious homogamy have primarily 
been assessed through surveys, largely because few places report religion on 
their marriage licenses and because the census is not allowed to ask questions 
on religious affiliation. By analyzing national surveys conducted between 
1955 and 1989 and using a design that separates the effects of period and dura- 
tion of marriage, Kalmijn (1991b) shows that intermarriage between Catholics 
and Protestants has increased in a linear fashion between 1920 and 1980. 
Trends in intermarriage between Jews and non-Jews have been documented by 
comparing subsequent surveys (Lazerwitz 1995) and by comparing marriage 
cohorts within a single survey (Kosmin et al 1991). Both types of analyses 
show that Jewish-Gentile intermarriage has increased considerably over the 
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last decades. An exception to these trends are conservative Christian groups, 
who appeared to have stable endogamy rates over time (McCutcheon 1988). 
In many other Western societies, questions on religion are included in mar- 
riage records so that long-term trends can be documented there more easily. A 
loglinear trend analysis of annual Dutch marriage records since the 1930s 
shows that religious endogamy of Catholics and the conservative Re- 
Reformed Protestants has declined (Hendrickx et al 1991). The more liberal 
Dutch Reformed Protestants experienced no decline, but they had low levels of 
endogamy to begin with. Marriage records in Switzerland (Schoen & Thomas 
1990) and Germany (Hendrickx et al 1994) also reveal a decline in the level of 
endogamy of Catholics and Protestants. That the boundaries between religious 
groups in Europe and the United States have weakened during the twentieth 
century 1s consistent with the notion of declining third-party control and 
matches long-term processes such as secularization and depillarization. 


SOCIOECONOMIC STATUS Trends in socioeconomic homogamy are most fre- 
quently studied by analyzing class background and education. In most indus- 
trialized countries, there has been a decline in the importance of social back- 
ground for marriage choice. This has been found for the United States (Kal- 
myn 19912), the Netherlands (Uunk 1996), Hungary (Uunk et al 1996), and 
France (Forsé & Chauvel 1995). The most common interpretation of this trend 
lies in the role of third parties and opportunity. Young adults have become in- 
creasingly independent of parents so that parents have less direct or indirect 
control over the choices their children make. People also spend more time in 
school settings, which are more heterogeneous with respect to social class 
background than the parental neighborhood. 

Trends in educational homogamy do not point ın one direction. A loglinear 
analysis of 18 postwar industrial nations by Ultee & Luijkx (1990) reveals that 
five countries experienced a decline in educational homogamy, three countries 
experienced an increase, while the remaining ten revealed no meaningful 
trend. Country-specific loglinear analyses also reveal a mixed pattern, al- 
though they do not reveal a decline in homogamy: (a) a strong increase 1n the 
United States (Mare 1991; Kalmijn 19912), Hungary (Uunk et al 1996), and 
Germany (Blossfeld & Timm 1997); (b) a slight increase in the Netherlands 
(Hendrickx 1994); and (c) stability in Australia (Jones 1987) and France 
(Forsé & Chauvel 1995). 

Several hypotheses have been suggested to explain these trends. Some 
authors argue that opportunities for making a match on education have in- 
creased. People marry later and spend more time in school, but the time inter- 
val between leaving school and marriage has narrowed. As a result, it is now 
more likely that unmarried people, especially the college educated, meet their 
spouse in school (Mare 1991). Others point to the role of preferences. Educa- 
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tion has become an increasingly important proxy for both cultural taste and so- 
cioeconomic success, and competition among men for socioeconomic re- 
sources in women may have increased (Kalmijn 1991a; Schoen & Wooldredge 
1989). There are also hypotheses predicting a decrease in educational homog- 
amy. Some authors argue that marriage choice has become increasingly based 
on emotional or affective considerations. Because romantic considerations of- 
ten overrule status concerns, one would expect a decline in all forms of status 
homogamy (Ultee & Luijkx 1990). 

An attempt to reconcile these hypotheses is made by Smits et al (1998), who 
argue that educational homogamy will initially increase with levels of indus- 
trialization because in this phase, education becomes the dominant criterion 
for socioeconomic success and cultural norms and values. Romantic consid- 
erations and individualism gain importance in later stages of the industrializa- 
tion process when high standards of living are guaranteed for everyone. As a 
result, educational homogamy will first increase with levels of industrializa- 
tion, but will eventually decrease. A comparison of 64 countries provides indi- 
rect support for this claim: The relationship between educational homogamy 
and the level of industrialization follows an inverted “U.” 


Variations in Intermarriage and Homogamy 


Next to describing patterns and trends, authors have analyzed variations in in- 
termarriage and have examined what factors contribute to outmarriage. Recur- 
ring themes in the literature are differences by sex, by education, and by re- 
gion. Although these factors are generally studied in an exploratory fashion, 
they also give us clues about the causes of endogamy. 


SEX DIFFERENCES Sex differences have most often been studied in the litera- 
ture on racial and ethnic intermarriage. Studies on black-white intermarriage 
in the United States consistently show that black men marry whites more often 
than black women (Kalmijn 1993b; Schoen & Wooldredge 1989). A tradi- 
tional interpretation of this finding 1s that minority men are able to compensate 
for their lower “ethnic prestige" by offering white women a high occupational 
status or income. Although in principle one could reverse the exchange—high- 
status minority women could marry white men of lower status—under condi- 
tions of traditional sex-roles, this type of marriage is believed to be uncommon 
because the status of the family is largely dependent on the status of the hus- 
band. 

Although the interpretation is plausible, findings for other ethnic groups 
provide a counterpoint. Asian-American women, for example, and in particu- 
lar Japanese-American women, marry whites more often than their male coun- 
terparts (Sung 1990). A speculative interpretation of this exception is that 
Asian-American women are attractive marriage candidates for white men be- 
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cause of their physical appearance and presumed acceptance of more tradi- 
tional power relationships in marriage. À more-plausible interpretation lies in 
the role of opportunity: the presence of American soldiers in Japan and Korea. 
A recent analysis shows that excluding such war brides leads to a substantial 
reduction in the sex differential in Asian-American intermarriage (JJ Jacobs & 
T Labov, unpublished manuscript). 

Sex differences have also been studied in the analysis of socioeconomic ho- 
mogamy. A common finding is that highly educated men and men in profes- 
sional and technical occupations marry down more often than up (Mare 1991; 
Kalmijn 1994). Laymen generally interpret downmarrying as evidence of a re- 
luctance on the part of men to marry high-status women, but most of the asym- 
metry is due to differences in the composition of men's and women's charac- 
teristics. On average, women have traditionally been less educated and less of- 
ten have had high-status occupations than men. Once such differences are 
taken into account through loglinear analyses, researchers generally find little 
evidence of asymmetry (Mare 1991). Similar conclusions apply to trends: 
Educational downmarrying among men has become less common, but this is 
largely due to the increased educational attainment of women (Mare 1991). 


EDUCATIONALEFFECTS Another frequently examined factor in intermarriage 
is education. Many studies have found that more highly-educated members of 
ethnic or racial minority groups marry exogamously more often than their 
lesser-educated peers. This applies to white ethnic groups (Lieberson & Wa- 
ters 1988), blacks (Kalmijn 1993b; Schoen & Wooldredge 1989), and Ameri- 
can Indians (Sandefur & McKinnell 1986). Less consistent evidence is found 
for outmarriage of Asian-Americans (Hwang et al 1995; Wong 1989; Schoen 
& Thomas 1989). 

Educational effects have been interpreted in terms of both opportunity and 
preference. The former interpretation states that better educated minority 
members are more often exposed to settings such as colleges and high-status 
occupations where they form a relatively smaller group than in the population 
at large. Another interpretation states that more highly educated persons—of 
both majority and minority groups—have a more individualistic attitude, are 
less attached to their family and community of origin, and have a more univer- 
salistic view on life than lesser-educated persons. As a result, they would find 
ascribed characteristics less relevant in deciding whom to marry. 


DIFFERENCES BETWEEN REGIONS AND SETTINGS Virtually all studies find 
large regional differences 1n intermarriage. Asian-Americans marry out less 
often in California, where they are concentrated, than 1n the rest of the United 
States (Wong 1989), Indian-Americans are more endogamous in so-called In- 
dian States (Sandefur & McKinnell 1986), and blacks marry more endoga- 
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mously in states where the percentage of blacks in the population is larger, a re- 
lationship that is observed both in and outside the South (Kalmijn 1993b). 
Such patterns undoubtedly point to the role of opportunity: The smaller the 
group, the more difficult it is to marry within the group. Hypotheses about 
group size and its corollary, heterogeneity, have also been examined by ana- 
lyzing (Standard) Metropolitan Statistical Areas [(S)MSAs) or states through 
correlational analyses. These studies find that relative group size is negatively 
correlated with black outmarriage and ethnic outmarriage; similarly, racial, 
ethnic, and occupational heterogeneity have positive effects on the respective 
types of intermarriage (Blau et al 1982; Hwang et al 1994). 

The role of opportunity has also been analyzed by considering local mar- 
riage markets. In a classic study, Ramsey (1966) analyzed marriage licenses in 
a Norwegian city and found that husbands and wives lived close to each other 
before marriage, and in fact closer than one would expect under conditions of 
random mating in a city. While this confirms that neighborhoods are marriage 
markets, Ramsey also showed that people who lived close to one another be- 
fore marriage did not marry more homogamously with respect to occupation 
than people who lived far apart. Hence, it appeared that the neighborhood did 
not by itself promote occupational homogamy. Ramsey's analysis was criti- 
cized on methodological grounds by Peach (1974), but later, more elaborate 
analyses of newlyweds in a New Zealand city by Morgan (1981) found no 
clear link between spatial and status proximity either. Analyses of ethnic en- 
dogamy yield a more promising conclusion. Anderson & Saenz (1994), for ex- 
ample, find that MSAs in which Mexican-Americans are residentially segre- 
gated from non-Hispanic whites have lower degrees of Mexican-American 
outmarriage, even when other group characteristics such as mean educational 
level are controlled for. 

The school is another local marriage market, but its role in promoting edu- 
cational homogamy has only been studied indirectly. Mare (1991) found that 
people who marry closer to finishing school, or while in school, marry people 
who are more similar in education than people who marry long after finishing 
school. This relationship was found to be present only for higher levels of edu- 
cation. À similar relationship is found in Germany, although there, a decline in 
educational homogamy is only observed when comparing couples who mar- 
ried a few years after leaving school with couples who married much later 
(Blossfeld & Timm 1997). These findings provide indirect evidence that 
schools function as marriage markets that favor educational homogamy. 
Schools, and in particular colleges, are educationally homogeneous, while the 
settings people face when they search for a partner at a later stage, such as work 
settings and public places, tend to be less homogeneous. Further evidence on 
the role of schools is provided by Uunk & Kalmijn (1996), who show that the 
college-educated in the Netherlands have a tendency to marry someone who 
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has the same college major. In the Netherlands, fields of study can be regarded 
as local marriage markets within the university because students choose a ma- 
jor when they first enroll and do not follow courses in other fields. 


Multiple Dimensions 


Most early studies analyzed a single sociological characteristic at a time or 
analyzed several characteristics one-by-one. Since partners choose each other 
on the basis of multiple characteristics, it is important to analyze more than one 
factor in marriage choice. In the last decades, several such multidimensional 
analyses have been done, although most are limited to two dimensions. Re- 
search on multiple dimensions has been guided by two hypotheses: the by- 
product hypothesis and the exchange hypothesis. 


BY-PRODUCT HYPOTHESES There is a considerable overlap between social 
groups in society. Ethnic groups, for example, differ in educational level, re- 
ligion and ethnicity often coincide, and education and social background are 
correlated. Because the various social dimensions on which individuals select 
one another are correlated, and because people are believed to take all these di- 
mensions into account when choosing a spouse, the question arises if and to 
what extent homogamy in one group dimension 1s the by-product of selection 
in another group dimension. 

An early attempt to examine this issue empirically was done by Warren 
(1966; see also Blau and Duncan 1967:354—59), who showed that the correla- 
tion between the spouses' fathers' occupations 1s reduced substantially when 
controlling for spouses' education. This result led Warren to conclude that so- 
cial class homogamy is largely a by-product of educational homogamy. More 
recent analyses confirm this and show in addition that educational homogamy 
is 1n part a by-product of matching of social origins (Kalmijn 1991a; Uunk 
1996). Hence, both forms of homogamy appear to be weaker when a multidi- 
mensional analysis is used, although even then, educational homogamy re- 
mains stronger than homogamy of social origins. 

The by-product hypothesis has also been a theme in the study of ethnic in- 
termarriage. A classic study of New Haven in the first half of this century by 
Kennedy (1944) showed that intermarriage is more common between groups 
who have the same faith, such as between Italians and Poles on the one hand 
(both largely Catholic) and between Hungarians and Russians on the other 
(both largely Jewish). Kennedy used the now classic term “triple melting pot" 
to describe this pattern. Kennedy's triple melting pot confirms the by-product 
hypothesis because it reveals that marriage boundaries between certain ethnic 
groups are in part the result of differences with respect to religion. Recent, 
more sophisticated loglinear analyses confirm that there are strong marriage 
boundaries between ethnic groups who have a dissimilar faith (Alba & Golden 
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1986), but no studies have simultaneously analyzed individual ethnic and re- 
ligious characteristics of husbands and wives. 


EXCHANGEHYPOTHESES A second theme in multidimensional analyses is the 
question of whether people trade characteristics when choosing a spouse. Sev- 
eral examples of exchange have been considered, but the most debated case 
was introduced by Davis (1941) and Merton (1941), who argued that members 
of ethnic groups whose prestige 1n society is low would have better chances of 
marrying outside their group if they offered a high socioeconomic status in re- 
turn. 

The Davis-Merton hypothesis is most frequently examined in research on 
ethnic and racial intermarriage. Loglinear and harmonic mean analyses of 
black-white intermarriage by Kalmijn (1993a) and Schoen & Wooldredge 
(1989) show that with respect to education, white women marry up more often 
when marrying a black man than when marrying a white man; similarly, black 
men marry down more often when marrying a white woman than when marry- 
ing a black woman. Similar conclusions apply when examining the marriage 
choices of white men and black women. White men marry down less often 
when marrying exogenously and black women marry up less often in mixed 
marriages. These asymmetries in spouses' educational characteristics are as- 
sessed after controlling for the marginal educational distributions of race-sex 
groups and thereby support the hypothesis that majority men and women 
marry a minority spouse in part under the condition of socioeconomic status 
gains. 

While the pattern of black-white marriage provides support for the Davis- 
Merton hypothesis, studies of other types of ethnic homogamy are less consis- 
tent. In a harmonic mean analysis of Asian intermarriage in Hawaii, Schoen & 
Thomas (1989) show that after controlling for differences in educational distri- 
butions, white women marry up more often when they marry Filipino and 
Japanese males, consistent with the notion of exchange. The reverse is true, 
however, when white women marry Hawaiian or Chinese males. 

The exchange hypothesis has also been applied to other dimensions of part- 
ner choice, such as physical attractiveness and cultural participation. To exam- 
ine exchanges, studies generally rely on correlational analyses in which socio- 
economic characteristics of the husband are regressed on socioeconomic and 
noneconomic characteristics of the wife. Effects of the wife's noneconomic 
characteristics on the husband's socioeconomic characteristics are usually 
called crossing effects and are considered evidence for exchange. 

In an early analysis of physical attractiveness of women and occupational 
prestige of men, Taylor & Glenn (1976) show that female attractiveness has a 
positive effect on the occupational prestige of the man she marries, even when 
controlling for her own socioeconomic characteristics. A drawback of this 
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analysis is that husband's attractiveness was not included in the model. If oc- 
cupational prestige and attractiveness are correlated within individuals, and if 
people match in attractiveness, part of the effect of female attractiveness on 
male prestige may be spurious. À more recent analysis, which also controls for 
the physical attractiveness of the husband, confirms this. Stevens et al (1990) 
find no effect of female attractiveness on husbands' education, suggesting that 
no exchanges are being made. 

Another example of exchange is that between socioeconomic status and 
participation in high culture. DiMaggio & Mohr (1985) find that participation 
of the wife in high culture has a positive effect on the educational level of the 
husband, net of the educational level of the wife. Because no measures of the 
husband's cultural participation were included in this model, the exchange ef- 
fect might again be due to homogamy with respect to cultural participation. An 
analysis for the Netherlands, however, shows that this is not the case. Uunk 
(1996) analyzes the correlation between the wife's cultural participation and 
the husband's educational level while controlling not only for the wife's edu- 
cational level but also for the husband's cultural participation. Uunk finds a 
significant partial association between female high culture and male educa- 
tion, providing support for the exchange hypothesis. 


CONCLUSION 


Sociological research on marriage choices has generated many insights in how 
modern society is differentiated. In general, social groups in society appear 
closed, in the sense that men and women more often choose partners within 
their group than one would expect under random mating. Although some 
groups are more closed than others, examples of social groups who marry ex- 
ogamously have not (yet) been found. Research on intermarriage also reveals 
how societies change. Overall, ascribed bases of group membership have be- 
come less important, while achieved bases of group membership, and espe- 
cially those governed by education, have not lost salience. This is not to say 
that ascribed groups are mixing freely now. Ethnic, religious, and particularly 
racial boundaries still exist, but they are weaker than they used to be. 

The focus of the literature has largely been descriptive. Researchers have 
scrutinized a mass of data—coming from censuses, surveys, and marriage li- 
censes—and have studied many groups, several countries, and long periods of 
time. Because marriage patterns are telling indicators of how closed groups in 
a society are, the descriptive focus of the literature has much to say for it. 
Monitoring such a social indicator also requires a certain degree of standardi- 
zation, and in this respect, the literature has its shortcomings. Some studies are 
limited to percentages, which are heavily affected by relative group size, mak- 
ing it difficult to compare endogamy across groups. Other studies use loglinear 
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models, and while these are an advance from a methodological point of view, 
they have probably made the literature less accessible to a general audience 
than it deserves to be. In carrying on its descriptive mission, the literature 
would gain by using simple odds ratios next to percentages or loglinear models 
when describing the degree of endogamy of groups. A good example of such 
an approach is provided in Lieberson & Waters (1988). In describing trends, it 
would also be useful to focus on recently formed marriages rather than on the 
stock of marriages at a given point in time, largely because the aim is to present 
social indicators for a clearly defined period in time. 

In a theoretical sense, there has also been progress in the field. There are 
many theories about partner choice, and such notions provide important clues 
about the causes of intermarriage and homogamy. In general, marriage pat- 
terns arise from three social forces: the preferences of individuals for resources 
in a partner, the influence of the social group, and the constraints of the mar- 
riage market. The multifaceted perspective that has been developed over the 
years gives sociological theorizing an edge over competing theories of mar- 
riage choice such as those developed by psychologists and economists. 

Considerable empirical evidence exists for these theoretical notions, but the 
integration of empirical and theoretical work is less than perfect. There are two 
basic problems 1n empirical work. First, many hypotheses are tested in an indi- 
rect fashion. The role of third-party control, for example, is documented by 
comparing ethnic groups or religious denominations, but little information is 
available on what these parties in fact are doing. Similarly, the trend towards 
increasing educational homogamy may point to heightened competition for 
economic resources on the marriage market, but this is an interpretation, not a 
test. A second and related problem in empirical work is that many of the ob- 
served regularities and relationships can be attributed to all three types of 
causes, while little 18 yet known about the relative strengths of these factors. 
That more highly educated members of minority groups are less endogamous, 
for example, may be attributed to a universalistic attitude brought about by 
higher education, but can also be explained in terms of greater opportunities to 
meet outgroup members. In a similar fashion, residential segregation may fos- 
ter endogamy by lowering opportunities, but its effect may also be explained 
by different socialization practices in segregated areas. 

While it is clear that progress can be made in integrating theoretical and em- 
pirical work, this is not a straightforward task either. One possible solution is 
to shift the focus from the aggregate to the individual level. In the past, most 
studies have compared countries, groups, or time periods, but few studies have 
analyzed individual differences in intermarriage. The main advantage of an in- 
dividual approach is that it facilitates the inclusion of a range of covariates for 
each of the three elements of the theory. For example, one can focus on sociali- 
zation practices and characteristics of the parental home to test hypotheses 
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about third-party control. One can also include sex-role attitudes or expecta- 
tions regarding paid and domestic labor later in life to examine the role of pref- 
erences. Finally, one can include contextual variables, such as characteristics 
of the settings in which young adults are embedded (schools, workplaces, 
neighborhoods) to assess the effects of local marriage markets on intermar- 
riage. 

While an individual design would facilitate the multivariate analyses that 
are needed to test hypotheses more directly and to compare the strength of al- 
ternative explanations, such a design has its problems as well. Because multi- 
variate analyses of marriage choices use individuals as the unit of analysis, 
they provide a one-sided view of marriage. It takes two to marry, and for that 
reason most authors have used loglinear or harmonic mean models. Such mod- 
els correctly use marriages as the unit of analysis, rather than individuals, but 
make it difficult to include multiple covariates in the model. From a methodo- 
logical point of view, such models are preferable, but if the prime concern is to 
test theories, their advantage is not so obvious. 
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ABSTRACT 


Work on ethnic and nationalist violence has emerged from two largely non- 
intersecting literatures: studies of ethnic conflict and studies of political vio- 
lence. Only recently have the former begun to attend to the dynamics of vio- 
lence and the latter to the dynamics of ethnicization. Since the emergent lit- 
erature on ethnic violence is not structured by clearly defined theoretical op- 
positions, we organize our review by broad similarities of methodological 
approach: (a) Inductive work at various levels of aggregation seeks to iden- 
tify the patterns, mechanisms, and recurrent processes 1mplicated in ethnic 
violence. (b) Theory-driven work employs models of rational action drawn 
from international relations theory, game theory, and general rational action 
theory. (c) Culturalist work highlights the discursive, symbolic, and ritualis- 
tic aspects of ethnic violence. We conclude with a plea for the disaggregated 
analysis of the heterogeneous phenomena we too casually lump together as 
"ethnic violence." 


INTRODUCTION 


The bloody dissolution of Yugoslavia, intermittently violent ethnonational 
conflicts on the southern periphery of the former Soviet Union, the ghastly 
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butchery in Rwanda, and Hindu-Muslim riots in parts of India, among other 
dispiriting events, have focused renewed public attention in recent years on 
ethnic and nationalist violence as a striking symptom of the “new world disor- 
der." 

To be sure, measured against the universe of possible instances, actual in- 
stances of ethnic and nationalist violence remain rare. This crucial point is ob- 
scured in the literature, much of which samples on the dependent variable 
(Fearon & Laitin 1996) or metaphorically mischaracterizes vast regions (such 
as post-communist Eastern Europe and Eurasia 1n its entirety or all of sub- 
Saharan Africa) as a scething cauldron on the verge of boiling over or as a tin- 
derbox, which a single careless spark could ignite into an inferno of ethnona- 
tional violence (Bowen 1996, Brubaker 1998). Ethnic violence warrants our 
attention because it is appalling, not because it is ubiquitous. 

Nonetheless, although measurement and coding problems prevent confi- 
dent calculations, two general features of the late modern, post-Cold War 
world—1n addition to the particular traumas of state collapse in the Soviet and 
Yugoslav cases—have probably contributed to a recent increase in the inci- 
dence of ethnic and nationalist violence and have certainly contributed to an 
increase in the share of ethnic and nationalist violence in all political vio- 
lence—that is, to what might be called the ethnicization of political violence. 
The first could be called "the decay of the Weberian state": the decline (un- 
even, to be sure) in states' capacities to maintain order by monopolizing the le- 
gitimate use of violence in their territories and the emergence in some re- 
gions—most strikingly in sub-Saharan Africa—of so-called quasi-states 
(Jackson 1990, Jackson & Rosberg 1982), organizations formally acknowl- 
edged and recognized as states yet lacking (or possessing only in small degree) 
the empirical attributes of stateness. 

The end of the Cold War has further weakened many third world states as 
superpowers have curtailed their commitments of military and other state- 
strengthening resources, while the citizenries—and even, it could be argued, 
the neighbors—of Soviet successor states are more threatened by state weak- 
ness than by state strength (Holmes 1997). Such weakly Weberian states or 
quasi-states are more susceptible to—and are by definition less capable of re- 
pressing, though not, alas, of committing—violence of all kinds, including eth- 
nic violence (Desjarlais & Kleinman 1994). Meanwhile, the stronger states of 
the West are increasingly reluctant to use military force—especially unilater- 
ally, without a broad consensus among allied states—to intervene in conflicts 
outside their boundaries (Haas 1997). As a result, weakly Weberian third 
world states can no longer rely on an external patron to maintain peace as they 
could during the Cold War era. 

The second contextual aspect of the post-Cold War world to highlight is the 
eclipse of the left-right ideological axis that has defined the grand lines of 
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much political conflict—and many civil wars—since the French Revolution. 
From the 1950s through the early 1980s, violence-wielding opponents of ex- 
isting regimes could best mobilize resources—money, weapons, and political 
and logistical support—by framing their opposition to incumbents in the lan- 
guage of the grand ideological confrontation between capitalism and commu- 
nism. Incumbents mobilized resources in the same way. Today, these incen- 
tives to frame conflicts in grand ideological terms have disappeared. Even 
without direct positive incentives to frame conflicts in ethnic terms, this has 
led to a marked ethnicization of violent challenger-incumbent contests as the 
major non-ethnic framing for such contests has become less plausible and 
profitable. 

Moreover, there may be positive incentives to frame such contests in ethnic 
terms. With the increasing significance worldwide of diasporic social forma- 
tions (Clifford 1994, Appadurai 1997), for example, both challengers and in- 
cumbents may increasingly seek resources from dispersed transborder ethnic 
kin (Tambiah 1986, Anderson 1992). And a thickening web of international 
and nongovernmental organizations has provided greater international legiti- 
macy, visibility, and support for ethnic group claims (normatively buttressed 
by culturalist extensions and transformations of the initially strongly individu- 
alist human rights language that prevailed in the decades immediately follow- 
ing World War II). This institutional and normative transformation at the level 
of what Meyer (1987) calls the “world polity” provides a further incentive for 
the ethnic framing of challenges to incumbent regimes. To foreshadow a 
theme we underscore later: Ethnicity is not the ultimate, irreducible source of 
violent conflict in such cases. Rather, conflicts driven by struggles for power 
between challengers and incumbents are newly ethnicized, newly framed in 
ethnic terms. 


Ethnicity, Violence, and Ethnic Violence 


Attempts to theorize ethnic and nationalist violence have grown from the soil 
of two largely nonintersecting literatures: studies of ethnicity, ethnic conflict, 
and nationalism on the one hand, and studies of collective or political violence 
on the other. Within each of these large and loosely integrated literatures, eth- 
nic and nationalist violence has only recently become a distinct subject of in- 
quiry in its own right. 

In the study of ethnicity, ethnic conflict, and nationalism, accounts of con- 
flict have not been distinguished sharply from accounts of violence. Violence 
has generally been conceptualized—if only tacitly—as a degree of conflict 
rather than as a form of conflict, or indeed as a form of social or political action 
in its own right. Most discussions of violence in the former Yugoslavia, for ex- 
ample, are embedded in richly contextual narratives of the breakup of the state 


426 BRUBAKER & LAITIN 


(Glenny 1992, Cohen 1993, Woodward 1995). Violence as such has seldom 
been made an explicit and sustained theoretical or analytical focus in studies of 
ethnic conflict [though this has begun to change with Lemarchand's (1996) 
work on Burundi, and Tambiah's (1996), Brass's (1997) and Horowitz's 
(forthcoming) work on ethnic riots]. 

In the study of collective or political violence, on the other hand, ethnicity 
figured (until recently) only incidentally and peripherally. In a number of in- 
fluential studies (e.g. Gurr 1970, Tilly 1978) ethnicity figured scarcely at all. 
Revealingly, Gurr used the general term “dissidents” to describe nongovern- 
mental participants in civil strife. Although the empirical significance of eth- 
nicity was recognized, its theoretical significance was seldom addressed ex- 
plicitly; it was as if there was nothing analytically distinctive about ethnic (or 
ethnically conditioned or framed) violence. Ethnicity thus remained theoreti- 
cally exogenous rather than being integrated into key analytical or theoretical 
concepts. 

In recent years, to be sure, a pronounced “ethnic turn” has occurred in the 
study of political violence, paralleling the ethnic turn in international relations, 
security studies, and other precincts of the post-Cold War academic world. 
But this sudden turn to ethnicity and nationality too often has been external and 
mechanical (Brubaker 1998). Although ethnicity now occupies a central place 
in the study of collective and political violence, it remains a “foreign body" de- 
riving from other theoretical traditions. It has yet to be theoretically digested, 
or theorized in a subtle or sophisticated manner. 

This suggests two opportunities for theoretical advance today—and in fact 
significant work is beginning to emerge in these areas. On the one hand, it is 
important to take violence as such more seriously in studies of ethnic and na- 
tionalist conflict. It is important, that is, to ask specific questions about, and 
seek specific explanations for, the occurrence—and nonoccurrence (Fearon & 
Laitin 1996)—of violence in conflictual situations. These questions and expla- 
nations should be distinguished from questions and explanations of the exis- 
tence, and even the intensity, of conflict. We lack strong evidence showing that 
higher levels of conflict (measured independently of violence) lead to higher 
levels of violence. Even where violence is clearly rooted in preexisting con- 
flict, it should not be treated as a natural, self-explanatory outgrowth of such 
conflict, something that occurs automatically when the conflict reaches a cer- 
tain intensity, a certain “temperature.” Violence is not a quantitative degree of 
conflict but a qualitative form of conflict, with its own dynamics. The shift 
from nonviolent to violent modes of conflict 1s a phase shift (Williams 
1994:62, Tambiah 1996:292) that requires particular theoretical attention. 

The study of violence should be emancipated from the study of conflict and 
treated as an autonomous phenomenon in its own right. For example, to the ex- 
tent that ethnic entrepreneurs recruit young men who are already inclined to- 
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ward or practiced 1n other forms of violence, and help bestow meaning on that 
violence and honor and social status on its perpetrators, we may have as much 
to learn about the sources and dynamics of ethnic violence from the literature 
on criminology (Katz 1988) as from the literature on ethnicity or ethnic con- 
flict. 

At the same time, the strand of the literature that grows out of work on po- 
litical violence and collective violence should take ethnicity and nationality 
more seriously. This does not mean paying more attention to them; as noted 
above, there has already been a pronounced ethnic turn in the study of political 
violence and collective violence. That political violence can be ethnic is well 
established, indeed too well established; Aow it is ethnic remains obscure. The 
most fundamental questions—for example, how the adjective “ethnic” modi- 
fies the noun “violence”—remain unclear and largely unexamined. Sustained 
attention needs to be paid to the forms and dynamics of ethnicization, to the 
many and subtle ways in which violence—and conditions, processes, activi- 
ties, and narratives linked to violence-——can take on ethnic hues. 


Defining the Domain 


In reviewing emerging work in anthropology, political science, and to a lesser 
extent other disciplines as well as sociology, we immediately face the problem 
that there is no clearly demarcated field or subfield of social scientific inquiry 
addressing ethnic and nationalist violence, no well-defined body of literature 
on the subject, no agreed-upon set of key questions or problems, no established 
research programs (or set of competing research programs). The problem is 
not that there is no agreement on how things are to be explained; it is that there 
is no agreement on what is to be explained, or whether there is a single set of 
phenomena to be explained. Rather than confronting competing theories or ex- 
planations, we confront alternative ways of posing questions, alternative ap- 
proaches to or “takes” on ethnic and nationalist violence, alternative ways of 
conceptualizing the phenomenon and of situating it in the context of wider 
theoretical debates. In consequence, this review specifies the contours and at- 
tempts a critical assessment of an emergent rather than a fully formed litera- 
ture. 

What are we talking about when we talk about ethnic or nationalist vio- 
lence? The answer is by no means obvious. First, despite 11s seemingly palpa- 
ble core, violence is itself an ambiguous and elastic concept (Tilly 1978:174), 
shading over from the direct use of force to cause bodily harm through the 
compelling or inducing of actions by direct threat of such force to partly or 
fully metaphorical notions of cultural or symbolic violence (Bourdieu & Wac- 
quant 1992:167—74). But the difficulties and ambiguities involved in charac- 
terizing or classifying violence (which we shall understand here in a narrow 
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sense) as ethnic or nationalist! are even greater. Although these difficulties 
have yet to receive—and cannot receive here—the full exploration they de- 
serve, a few summary points can be made: 


1. The coding of past, present, or feared future violence as ethnic is not only an 
analytical but a practical matter. Violence is regularly accompanied by so- 
cial struggles to define 1ts meaning and specify its causes, the outcome of 
which—for example, the labeling of an event as a pogrom, a riot, or a rebel- 
lion—may have important consequences (Brass 1996b). 

2. Coding practices are influenced heavily by prevailing interpretive frames. 
Today, the ethnic frame is immediately and widely available and legiti- 
mate; it imposes itself on, or at least suggests itself to, actors and analysts 
alike. This generates a coding bias in the ethnic direction. A generation ago, 
the coding bias was in the opposite direction. Today, we—again, actors and 
analysts alike—are no longer blind to ethnicity, but we may be blinded by it. 
Our ethnic bias in framing may lead us to overestimate the incidence of eth- 
nic violence by unjustifiably seeing ethnicity at work everywhere and 
thereby artifactually multiplying instances of "ethnic violence" (Bowen 
1996). More soberingly, since coding or framing is partly constitutive ofthe 
phenomenon of ethnic violence, not simply an external way of registering 
and coming to terms with it intellectually, our coding bias may actually in- 
crease the incidence (and not simply the perceived incidence) of ethnic vio- 
lence. 

3. With these caveats in mind, we define ethnic violence on first approxima- 
tion as violence perpetrated across ethnic lines, in which at least one party is 
not a state (or a representative of a state), and in which the putative ethnic 
difference is coded— by perpetrators, targets, influential third parties, or 
analysts—as having been integral rather than incidental to the violence, that 
is, in which the violence is coded as having been meaningfully oriented in 
some way to the different ethnicity of the target. 


This preliminary definition allows us to exclude the violence between Ger- 
mans and Frenchmen on the Marne in 1914. Similarly, it allows us to exclude 
the assassination of Robert F. Kennedy, since the shooting was not interpreted 
in ethnoreligious terms as a Catholic being shot by a Muslim. But the defini- 
tion hardly allows us to define a focused domain of research. A great profusion 
of work-—only a small fraction of which is engaged by most contemporary 
analysts of ethnic violence—is related in one way or another to ethnic vio- 


'To avoid cumbersome repetition, we refer simply to “ethnic” rather than to “ethnic or 
nationalist " But we understand “ethnic” broadly as including “nationalist” (insofar as this latter 
term designates ethnic or ethnocultural forms of nationalism, as opposed to purely “civic” or state- 
centered forms of nationalism). 
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lence. The range and heterogeneity of this work compel us to be highly selec- 
tive in our review. We have had to exclude many pertinent literatures, or at best 
touch on them only in passing. These include literatures on pogroms (Klier & 
Lambroza 1992) and genocides (Dobkowski & Wallimann 1992); on antis- 
emitism (Langmuir 1990), Nazism (Burleigh & Wippermann 1991), fascism, 
and the radical right (Rogger & Weber 1965); on racial violence (Horowitz 
1983), race riots (Grimshaw 1969), and policing in racially or ethnically mixed 
settings (Keith 1993); on slavery (Blackburn 1997), colonialism (Cooper & 
Stoler 1997), third-world nationalist revolutions (Chaliand 1977, Goldstone et 
al 1991), and state formation [especially in contexts of encounters with abo- 
riginal populations (Bodley 1982, Ferguson & Whitehead 1992)]; on separa- 
tism (Heraclides 1990), irredentism (Horowitz 1991b), and the formation of 
new nation-states (Brubaker 1996); on xenophobia and anti-immigrant vio- 
lence (Bjórgo & Witte 1993), “ethnic unmixing" (Brubaker 1995, Hayden 
1996), forced migration (Marrus 1985), and refugee flows (Zolberg et al 
1989); on religious violence (Davis 1973); on terrorism (Stohl 1983, Wald- 
mann 1992), paramilitary formations (Fairbanks 1995), and state violence 
(van den Berghe 1990, Nagengast 1994); on conflict management (Azar & 
Burton 1986) and peace studies (Väyrynen et al 1987); on the phenomenology 
or experiential dimensions of violence (Nordstrom & Martin 1992); and on 
rage (Scheff & Retzinger 1991), humiliation (Miller 1993), fear (Green 1994), 
and other emotions and psychological mechanisms (e.g. projection, displace- 
ment, identification) implicated 1n ethnic and nationalist violence (Volkan 
1991, Kakar 1990).? Clearly, this would be an unmanageable set of literatures 
to survey. Moreover, most of these are well-established, specialized literatures 
addressing particular historical forms and settings of ethnic or nationalist vio- 
lence, whereas we have interpreted our task as that of bringing into focus a 
newly emerging literature addressing ethnic violence as such. For different 
reasons, we neglect the theoretically impoverished policy-oriented literature 
on conflict management, and for lack of professional competence, we neglect 
the psychological literature. 

Since the emerging literature we survey is not structured around clearly de- 
fined theoretical oppositions, we organize our review not by theoretical posi- 
tion but by broad similarities of approach. We begin by considering a variety 
of inductive analyses of ethnic and nationalist violence that build on statistical 
analysis of large data sets, on the extraction of patterns from sets of broadly 
similar cases, on controlled comparisons, and on case studies. We next consid- 
er clusters of theory-driven work on ethnic violence deriving from the realist 


2Citations here are merely illustrative, we have tried to cite relatively recent, wide-ranging, or 
otherwise exemplary works, in which ample citations to further pertinent literature can be found. 
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tradition in international relations, from game theory, and from rational choice 
theory. We conclude by examining culturalist analyses of ethnic violence. 

We recognize the awkwardness of this organizing scheme. It is logically 
unsatisfactory, combining methodological and substantive criteria. It lumps 
theoretically and methodologically heterogeneous work under the loose rubric 
"inductive." It risks implying, incorrectly, that inductive work is not theoreti- 
cally informed, and that culturalist approaches are neither inductive nor theory 
driven. We nonetheless adopt this scheme in an effort to mirror as best we can 
the emerging clusters of work. 


INDUCTIVE APPROACHES 


Without questioning the truism that all research—and all phases of research 
(including data collection)—is theoretically informed, we can characterize the 
work grouped under this heading as primarily data-driven rather than theory- 
driven. This work seeks to identify the regularities, patterns, mechanisms, and 
recurrent processes comprising the structure and texture of ethnic violence in 
inductive fashion through the systematic analysis of empirical data. The data 
in question range from large sets of highly aggregated data through small-n 
comparisons to single case studies. Methods of analysis range from statistical 
analysis and causal modeling to qualitative interpretation. We organize our 
discussion by level of aggregation. 


Large Data Sets 


Gurr has been a leading figure in the study of political violence for three dec- 
ades and a pioneer in the statistical analysis of large data sets in this domain 
(1968). His first major work (1970) outlined an "integrated theory of political 
violence" as the product of the politicization and activation of discontent aris- 
ing from relative deprivation. Although ethnicity played no role in his early 
Work, it has become central to his recent work (1993a, 1993b, 1994, Harff & 
Gurr 1989, Gurr & Harff 1994). This work has been built on a large-scale data 
set surveying 233 “minorities at risk" that have (a) suffered (or benefited from) 
economic or political discrimination and/or (b) mobilized politically in de- 
fense of collective interests since 1945. For each of these “nonstate communal 
groups"—classified as ethnonationalists, indigenous peoples, ethnoclasses, 
militant sects, and communal contenders—-Gurr and associates have assem- 
bled and coded on ordinal scales a wide array of data on background character- 
istics (such as group coherence and concentration), intergroup differentials 
and discrimination, and group grievances and collective action. They then seek 
to explain forms and magnitudes of nonviolent protest, violent protest, and re- 
bellion through an eclectic synthesis of grievance and mobilization variables. 
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This work sensitizes us to the sharply differing dynamics, configurations, 
and magnitudes of ethnic violence across regions. This comparative perspec- 
tive is crucial, since violence in Northern Ireland or in the Basque region, 
while unsettling in the context of post-World War II Europe, can be placed in 
more benign perspective when compared to Burundi, Rwanda, Sri Lanka, or 
post-Cold War Bosnia, where killing is measured not in the hundreds and 
thousands but in the tens or even hundreds of thousands (Heisler 1990). The 
standardized data set built by Gurr and associates gives us little reason to be- 
lieve that the processes and mechanisms generating violence in Northern Ire- 
land are the same as those that drive the violence in Sr1 Lanka. It is not even 
clear, as we shall suggest in the conclusion, that these are both instances of the 
same thing (i.e. ethnic violence). 

If for Gurr the unit of analysis is the group, for Olzak (1992), Tarrow 
(1994), and Beissinger (1998), the unit is the event. Assembling data on ethnic 
and racial confrontations and protests in the United States in the late nineteenth 
and early twentieth century, Olzak uses event history analysis and ecological 
theories of competition and niche overlap to show that the breakdown of ethnic 
and racial segregation, by increasing economic and political competition, trig- 
gers exclusionary collective action, including ethnic and racial violence. 
Beissinger, constructing a database on violent collective events in the disinte- 
grating Soviet Union and its incipient successor states, analyzes the highly 
clustered incidence of nationalist violence in the context of a larger cycle of 
nationalist contention. He shows that nationalist struggles turned increasingly 
violent (and increasingly assumed the form of sustained armed conflict) late in 
the mobilizational cycle, in connection with the contestation of republican 
(and incipient state) borders at a moment when effective authority (to the ex- 
tent it existed at all) was passing from the collapsing center to the incipient suc- 
cessor states. In part, Beissinger echoes the findings of Tarrow (1994, Della 
Porta & Tarrow 1986) concerning the tendency for violence in Italy to occur 
toward the end of a mobilizational cycle. Although not directly concerned with 
ethnicity, Tarrow's work—notably his finding that violence does not map di- 
rectly onto protest—has implications for the study of ethnic violence. In Italy, 
violence appears to increase when organized protest weakens. Às mobilization 
wanes, violence is practiced by splinter groups as the only way to cause disrup- 
tion. Although the dynamics of the two cases differ, both Beissinger and Tar- 
row analyze violence as a phase in a mobilizational cycle rather than as a natu- 
ral expression of social conflict or social protest. 


Case-Based Pattern Finding 


For the analysis of ethnic conflict and violence in postcolonial Africa and Asia, 
Horowitz (1985) remains the classic text. Seeking to extract patterns from sets 
of broadly comparable cases, he stresses the social psychological and cogni- 
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tive underpinnings as well as the richly elaborated symbolic dimensions of 
violent ethnic conflict, giving particular emphasis to comparative, anxiety- 
laden judgments of group worth and competing claims to group legitimacy. 
At the same time, Horowitz has given systematic attention to the effects of in- 
stitutions—notably electoral systems, armed forces, and federalist arrange- 
ments—3in fostering or preventing violent ethnic conflict (1985:Parts 3—5, 
1991c). His arguments concerning institutional design—notably the design of 
electoral systems—1n the context of post-apartheid South Africa (19912) have 
led to a lively debate with Lijphart (1990). 

More recently, Horowitz (forthcoming) has returned to an earlier (1973, 
1983) concern with ethnic riots. He analyzes the morphology and dynamics of 
the “deadly ethnic riot,” building inductively from detailed reports on a hun- 
dred riots, mainly since 1965, in some 40 postcolonial countries. Arguing for a 
disaggregated approach to ethnic violence, Horowitz distinguishes the deadly 
ethnic riot—defined as mass civilian intergroup violence in which victims are 
chosen by their group membership—from other forms of ethnic (or more or 
less ethnicized) violence such as genocide, lynchings, gang assault, violent 
protest, feuds, terrorism, and internal warfare. The deadly ethnic riot is marked 
by highly uneven clustering in time and space, relatively spontaneous charac- 
ter (though not without elements of organization and planning), careful selec- 
tion of victims by their categorical identity, passionate expression of inter- 
group antipathies, and seemingly gratuitous mutilation of victims. 

Using broadly similar inductive approaches, other scholars have addressed 
ethnic riots in recent years, chiefly in the South Asian context (Freitag 1989; 
Das 1990a; Spencer 1990; Pandey 1992; Jaffrelot 1994; Brass 1996a, 1997). 
The most sustained contribution in this genre is Tambiah's (1996) richly tex- 
tured, multilayered account. While distancing himself from a simplistic instru- 
mentalist interpretation of ethnic riots as the Joint product of political manipu- 
lation and organized thuggery, Tambiah devotes considerable attention to the 
routinization and ritualization of violence, to the "organized, anticipated, pro- 
grammed, and recurring features and phases of seemingly spontaneous, cha- 
otic, and orgiastic actions" (p. 230), the cultural repertory and social infra- 
structure [what Brass (1996b:12) calls “institutionalized riot systems"] 
through which riots are accomplished. At the same time, however, reworking 
Le Bon, Canetti, and Durkheim, Tambiah seeks to theorize the social psycho- 
logical dynamics of volatile crowd behavior. 

Other works in the pattern-finding mode address not particular forms of 
ethnic violence (such as the deadly ethnic riot) in their entirety but rather (like 


3Working within a broadly similar theoretical tradition, Petersen (1998) argues that structurally 
induced resentment, linking individual emotion and group status, best accounts for ethnic violence 
in a broad range of East European cases. 
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Horowitz 1985) general mechanisms and processes that are implicated in eth- 
nic violence. As Blalock (1989) notes in a different context, such mechanisms 
and processes, although not the immediate or underlying cause of violent con- 
flicts, do causally shape their incidence and modalities. Here we restrict our at- 
tention to one class of such mechanisms and processes (albeit a large and im- 
portant one): to the ways in which inter-ethnic violence is conditioned and fos- 
tered by intra-ethnic processes.* 

One such mechanism involves in-group policing. As analyzed by Laitin 
(19952), this involves the formal or informal administration of sanctions, even 
violent sanctions, within a group so as to enforce a certain line of action vis-à- 
vis outsiders (who may be defined not only in ethnic terms but in religious, 
ideological, class, or any other terms). Practices such as “necklacing” in South 
African townships, "kneecapping" by the IRA, the execution of Palestinians 
alleged to have sold land to Israelis, and the killing of alleged "collaborators" 
in many other settings have attracted notoriety as techniques used by ethnona- 
tionalist radicals to maintain control over in-group followers. Pfaffenberger 
(1994), for example, shows how members of the dominant Tamil separatist 
group in Sri Lanka, the Liberation Tigers, have prevented young male Tamils 
from leaving Jaffna and murdered leaders of rival Tamil groups, dissidents 
within their own ranks, and civilian Tamils suspected of helping the Sinhalese. 

A second intragroup mechanism—and a classical theme in the sociology of 
conflict (Simmel 1955, Coser 1956) —1involves the deliberate staging, instiga- 
tion, provocation, dramatization, or intensification of violent or potentially 
violent confrontations with outsiders. Such instigative and provocative actions 
are ordinarily undertaken by vulnerable incumbents seeking to deflect within- 
group challenges to their position by redefining the fundamental lines of con- 
flict as inter- rather than (as challengers would have it) intragroup; but they 
may also be undertaken by challengers seeking to discredit incumbents. 

Gagnon’s (1994/1995) analysis of the role of intra-Serbian struggles in 
driving the bloody breakup of Yugoslavia is the most theoretically explicit re- 
cent contribution along these lines. Gagnon argues that a conservative coali- 
tion of party leaders, local and regional elites, nationalist intellectuals, and 
segments of the military leadership, threatened in the mid-1980s by economic 
crisis and strong demands for market-oriented and democratic reforms, pro- 
voked violent ethnic confrontation—first in Kosovo and then, more fatefully, 
in the Serb-inhabited borderland regions of Croatia—in a successful attempt to 
define ethnicity (specifically the alleged threat to Serb ethnicity) as the most 


4General mechanisms may, of course, be specified in a deductrve as well as an inductive 
manner Although most of the work cited in the rest of this subsection is broadly inductive, we also 
cite here for reasons of convenience a few deductive works. Deductive theonzing about general 
mechanisms implicated in ethnic violence 1s considered ın more sustained fashion in the next 
section. 
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pressing political issue and thereby to defeat reformist challengers and retain 
their grip on power. Although Gagnon's empirical analysis is one-sided in its 
exclusive focus on the Serbian leadership (partially similar points could be 
made about the Croatian leadership), his theoretical argument on the within- 
group sources of intergroup conflict is valuable. In a broader study of national- 
ism and democratization, Snyder (1998) argues that such strategies of provo- 
cation are particularly likely to occur, and to succeed, in newly democratizing 
but institutionally weak regimes. Other instances of such cultivated confronta- 
tions arising from intragroup dynamucs are found in Deng's (1995) study ofthe 
Sudan and Prunier's (1995) study of Rwanda. 

A third important intragroup mechanism is ethnic outbidding (Rabushka & 
Shepsle 1972, Rothschild 1981, Horowitz 1985:Chapter 8, Kaufman 1996). 
This can occur in a context of competitive electoral politics when two or more 
parties identified with the same ethnic group compete for support, neither (1n 
particular electoral configurations) having an incentive to cultivate voters of 
other ethnicities, each seeking to demonstrate to their constituencies that it 1s 
more nationalistic than the other, and each seeking to protect itself from the 
other's charges that 1t is "soft" on ethnic issues. The outbidding can “o’erleap 
itself" into violent confrontations, dismantling the very democratic institutions 
that gave rise to the outbidding. This is a powerful mechanism (and a general 
one, not confined to ethnic outbidding). How it works is theoretically clear, 
and that it sometimes works to intensify conflict and generate violence was 
classically, and tragically, illustrated in Sm Lanka (Horowitz 1991c, Pfaffen- 
berger 1994). 

Yet outbidding does not always occur, and it does not always pay off as a 
political strategy when it is attempted. Contrary to many interpretations, Gag- 
non (1996) argues that the violent collapse of Yugoslavia had nothing to do 
with ethnic outbidding. In his account, Serbian elites instigated violent con- 
flict, and framed it in terms of ethnic antagonism, not to mobilize but to demo- 
bilize the population, to forestall challenges to the regime. When they needed 
to appeal for public support during election campaigns, elites engaged not in 
ethnic outbidding but in "ethnic underbidding," striving to appear more mod- 
erate rather than more radical than their opponents on ethnic issues. Further 
work needs to be done (following Horowitz 1985) in specifying the conditions 
(e.g. different types of electoral systems) in which such outbidding is more or 
less likely to occur, and more or less likely to pay off. 

A fourth intragroup mechanism concerns the dynamics of recruitment into 
gangs, terrorist groups, or guerrilla armies organized for ethnic violence. Al- 
though most ethnic leaders are well educated and from middle-class back- 
grounds, the rank-and-file members of such organizations are more often 
poorly educated and from lower or working class backgrounds (Waldmann 
1985, 1989; Clark 1984). Considerable attention has been focused on the inter- 
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group dynamics that favor recruitment into such organizations. For example, 
interviewing IRA members, White (1993:Chapter 4) finds that many working- 
class Catholics joined the IRA after experiencing violence in their neighbor- 
hoods at the hands of British security forces and loyalist paramilitaries. We 
have little systematic knowledge, however, about the social and psychological 
processes within groups that govern the recruitment of young men (and, much 
more rarely, women) into disciplined, ethnically organized violence-wielding 
groups— processes such as the distribution of honor, the promising and provi- 
sion of material and symbolic rewards for martyrs, rituals of manhood, the 
shaming of those who would shun violence, intergenerational tensions that 
may lead the impetuous young to challenge overcautious elders, and so on. 


“Small-N” Comparisons 


Controlled comparisons have been relatively few, especially those comparing 
regions suffering from ethnic violence with regions in which similar ethnic 
conflicts have not issued in violence. The Basque/Catalan comparison is a 
natural in this respect and has been explored by Laitin (1995b), who focuses on 
linguistic tipping phenomena and the differential availability of recruits for 
guerrilla activity from rural social groups governed by norms of honor, and by 
Díez Medrano (1995), who focuses on the social bases of the nationalist move- 
ments. Varshney (1997) compares Indian cities that have similar proportions 
of Muslim and Hindu inhabitants and that share other background variables, 
yet have strikingly divergent outcomes in terms of communal violence. He ar- 
gues that high levels of “civic engagement" between communal groups ex- 
plain low levels of violence between Muslims and Hindus. Waldmann (1985, 
1989) compares the violent ethnic conflicts in the Basque region and Northern 
Ireland to the (largely) nonviolent conflicts in Catalonia and Quebec, and ex- 
plains the transition from nonviolent nationalist protest to violent conflict in 
the former cases in terms of the loss of middle-class control over the nationalist 
movement. Friedland and Hecht (1998) compare the violent conflicts for con- 
trol of sacred places in Jerusalem and the Indian city of Ayodhya. In both 
cases, they show, struggles over religious rights at sacred centers claimed by 
two religions—Jews and Muslims in Jerusalem, Hindus and Muslims in 
Ayodhya—have been closely bound up with struggles to establish, extend, or 
reconfigure nation-states.The comparison of Rwanda and Burundi is compel- 
ling because of stunning violence in both cases despite quite different histori- 
cal conditions This comparison has not been analyzed systematically, but Le- 
marchand (1996) suggestively discusses the multiple ways in which the two 
cases have become intertwined. To be sure, the idea of controlling all relevant 
variables through a "natural experiment" is illusory. But Laitin (1995b) de- 
fends the exercise as worthwhile because it compels us to focus on specific 
processes under differing conditions, setting limits to overgeneralized theory. 
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Case Studies 


In this domain as in others, “cases” continue to be identified generally with 
countries. Thus substantial literatures have formed around key cases such as 
Northern Ireland (McGarry & O'Leary 1995; Feldman 1991; Bruce 1992; Bell 
1993; White 1993; Aretxaga 1993, 1995); Yugoslavia (Woodward 1995, Co- 
hen 1993, Glenny 1992, Denich 1994, Gagnon 1994/1995); Sri Lanka (Kap- 
ferer 1988; Tambiah 1986; Kemper 1991; Pfaffenberger 1991, 1994; Spencer 
1990; Sabaratnam 1990); and Rwanda and Burundi (Lemarchand 1996, 
Prunier 1995, Malkki 1995). The identification of case with country, however, 
is a matter of convention, not logic. Ethnic or nationalist violence in a country 
is treated as a case when the violence is portrayed as a single processual whole. 
Ifthe violence is 1nstead construed as a set of separate (though perhaps interde- 
pendent) instances, then 1t becomes a case set, suitable for controlled compari- 
son or even for a large-n study. In Olzak's (1992) study of confrontations and 
protests, for example, the United States is not a case but the location for a 
large-n study of events. The breakup of Yugoslavia has most often been treated 
as a single complex interconnected case, but if we had adequately disaggre- 
gated data, it could be studied as a set of cases (for example, of recruitment to 
unofficial or quasi-official violence-wielding nationalist militias or gangs). 
Most case studies are organized around a core argumentative line. In Wood- 
ward’s 1995 analysis of Yugoslavia, for example, the cumulative effect of eco- 
nomic crisis, a weakening central state, and external powers' recognition of 
constituent “nations” that were incapable of acting like states created a secu- 
rity dilemma for minorities in the newly recognized states. For Deng (1995), 
the attempt by the North to identify the Sudanese nation as an Arab one could 
lead only to rebellion from the South, which had been enslaved by Arabs but 
never assimilated into an Arab culture. For Kapferer (1988), Sinhalese Bud- 
dhist myths and rituals—rooted in an embracing cosmology and “ingrained in 
the practices of everyday life" (p. 34)—provided a crucial cultural underpin- 
ning for a radically nationalizing Sinhalese political agenda and for anti-Tamil 
violence in Sri Lanka. In Prunier’s 1995 analysis of Rwanda, an externally im- 
posed ideology of sharp difference between Hutus and Tutsis, and postcolonial 
claims to exclusive control of the state on both sides of this colonially reified 
group difference, created a security dilemma favoring preemptive violence. 
At the same time, authors of these and other case studies recognize that the 
explanatory lines they highlight are partial, and they consequently embed 
these arguments in richly contextualized narratives specifying a web of inter- 
twined supporting, subsidiary, or qualifying arguments. As a result, one can- 
not evaluate these works on the same metric as one would the statistical or 
even the small-r studies. The rhetorical weight in case studies tends to be car- 
ried by the richness and density of texture; although a major argumentative 
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line is almost always identifiable, the argument takes the form of a seamless 
web rather than a distinct set of explanatory propositions. Attempts to extract 
precise propositions from such case studies often reduce the original argument 
to the point of caricature. ? However, close reading of such works can yield 
rich material on microsocial processes at low levels of aggregation that macro 
theories miss. 


THEORY-DRIVEN RATIONAL ACTION APPROACHES 


The main clusters of theory-driven work on ethnic violence have employed 
models of rational action, drawing in particular from the realist tradition in in- 
ternational relations, from game theory, and from rational choice theory in 
general.Ó Although “rational action" (or “strategy,” the preferred term of inter- 
national relations and game theory) is understood differently in these tradi- 
tions (referring in international relations to the grand designs of states engaged 
in power politics, in game theory to the fully specified plan for playing a par- 
ticular game, and in rational choice theory in general to individual action ori- 
ented to the maximization of subjective expected utility), ethnic violence in all 
three traditions is seen as a product of rational action (rather than emotion or ir- 
rationality), though structural background conditions are seen as crucially 
shaping the contexts of choice. 


International Relations Approaches 


International relations scholars of the realist school (Jervis 1978) posit the ex- 
istence of a "security dilemma" under conditions of anarchy in which even 
nonaggressive moves to enhance one's security, perceived as threatening by 
others, trigger countermoves that ultimately reduce one's own security. While 
formulated to explain interstate wars, the security dilemma has been applied to 
intrastate ethnic violence as well. 

A line of argument initiated by Posen (1993) focuses on the windows of op- 
portunity—and vulnerability—occasioned by the collapse of central authority 
in multiethnic empires (see also Carment et al 1997). In such circumstances, 
especially given an historical record of serious intergroup hostilities (ampli- 
fied and distorted, of course, in the retelling), groups are likely to view one an- 
other's nationalist mobilization as threatening. These perceived threats may 
create incentives for preemptive attack (or at least for countermobilization that 
will in turn be perceived as threatening by the other group, engendering a mo- 


Blalock (1989.14) notes the degradation of propositional conflict that typically occurs once 
case studies get taken up in the literature and involved in theoretical controversy 

These clusters are not, of course, mutually exclusive For example, Lake & Rothchild (1996) 
draw on elements from all three. 
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bilization spiral that can lead to violence, especially since violent action can be 
undertaken autonomously, under conditions of state breakdown, by small 
bands of radicals outside the control of the weak, fledgling successor states). 
To be sure, the international relations perspective on ethnic violence has its 
weaknesses. Ethnic conflict differs sharply from interstate conflict (Laitin 
1995c). States are distinct and sharply bounded entities [though to treat them 
as unitary actors, as international relations scholars commonly do (Van Evera 
1994), is problematic (Mann 1993:Chapters 3, 21)]. In contrast, ethnic groups 
are not "given" entities with unambiguous rules of membership, as is well 
known from a generation of research (Barth 1967, Young 1965). Rarely is a 
single leader recognized as authoritatively entitled to speak in the name of the 
group. As a result, ethnic groups generally lack what states ordinarily possess, 
namely, a leader or leaders capable of negotiating and enforcing settlements 
(Paden 1990, Podolefsky 1990). Moreover, ethnic group membership is fluid 
and context-dependent. Relatively high rates of intermarriage (as in the former 
Yugoslavia) mean that many people, faced with interethnic violence, are not 
sure where they belong. Boundary-strengthening, group-making projects 
within ethnic groups are almost always central to violent conflicts between 
groups, but these crucial intragroup processes are obscured by international re- 
lations—inspired approaches that treat ethnic groups as unitary actors. 


Game Theoretic Approaches 


In examining ethnic violence, game theorists subsume the issue as part of a 
general theory of social order (Kandori 1992, Landa 1994). With specific ref- 
erence to ethnic violence, however, game theorists seek to understand the ra- 
tionale for the choice to use violence, assuming that violence will be costly to 
both sides in any conflict (Fearon 1995). They are not satisfied with theories, 
especially psychological ones (Tajfel 1978), that can account for conflict or 
mistrust but not for violence. Game theorists seek to provide a specific account 
of violence rather than accept it as an unexplained and unintended byproduct 
of tense ethnic conflicts. 

There is no unitary or complete game theory of ethnic violence. Rather, 
game theorists have identified certain general mechanisms that help account 
for particular aspects of the problem of ethnic violence. Here we review game- 
theoretic accounts of three such mechanisms, associated with problems of 
credible commitments, asymmetric information, and intragroup dynamics, re- 
spectively. 

Fearon (1994) has developed a model of the problem of credible commit- 
ments and ethnic violence. In this model, the problem arises in a newly inde- 
pendent state dominated by one ethnic group but containing at least one power- 
ful minority group as well. The model focuses on the inability of an ethnicized 
state leadership to “credibly commit" itself to protect the lives and property of 


ETHNIC AND NATIONALIST VIOLENCE 439 


subordinate ethnic groups, who, as a result, have an interest in fighting for in- 
dependence immediately rather than waiting to see if the leadership honors its 
commitment to protect them. Once a war breaks out, as Walter (1994) shows, 
settlement is extremely difficult, because neither side will want to disarm with- 
out full confidence that the agreement will be adhered to; but no one will have 
such confidence unless the other side disarms. Weingast (1998) shows that in- 
dividuals who are told by their group leaders that they are targets for extermi- 
nation would rationally take up arms even if the probability is negligible that 
their leaders’ prognostications are accurate, since a low probability event with 
drastic consequences has a high expected disutility. Therefore ethnic war can 
emerge from a commitment problem even 1f only vague suggestions of repres- 
sion exist, or if only a maniacal wing of the ruling group has genocidal inten- 
tions. Weingast’s work is sensitive to the importance of institutions such as the 
consociational ones described by Lijphart (1977) that enhance the credibility 
of commitments. In the absence of such institutions, ethnic violence is more 
likely to occur. 

Some scholars discount the credible commitments problem, arguing that 
many states do not even seek to make such commitments to protect their mi- 
norities. Rothchild (1991) shows that ethnic violence in Africa is associated 
strongly with regimes that show no interest in bargaining with disaffected 
groups. In many cases violence results neither from fear nor from failed coor- 
dination but from deliberate policy. However, if violence of this type were not 
reciprocated, and carned few costs for its perpetrators, it would be, in game- 
theoretic terms, a dominant strategy for leaders of ethnocratic regimes; and re- 
searchers must then explain why this sort of violence is not more common than 
it is. 

Concerning the problem of information asymmetry, Fearon & Laitin (1996) 
suggest, with Deutsch (1954), that ethnic solidarity results from high levels of 
communication. As a result, in everyday interaction within an ethnic group, if 
someone takes advantage of someone else, the victim will be able to identify the 
malfeasant and to refuse future cooperation with him or her. High levels of inter- 
action and of information about past interaction make possible the “evolution 
of cooperation” (Axelrod 1984) within a community. Interethnic relations, 
however, are characterized by low levels of information; the past conduct of 
members of the other ethnic group, as individuals, is not known. Under such 
conditions, an ethnic incident can more easily spiral into sustained violence, if 
members of each group, not being able to identify particular culprits, punish 
any or all members of the other group. This unfortunate equilibrium, Fearon 
and Laitin show, is not unique. They describe an alternate equilibrium, one that 
helps explain why violent spiraling, although gruesome, is rare. They find that 
even under conditions of state weakness or breakdown, ethnic cooperation can 
be maintained by local institutions of in-group policing-—where leaders of one 
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group help identify and punish the instigators of the violence against members 
of the other group—and intergroup mediation. The in-group policing equilib- 
rium is one in which interethnic violence can be cauterized quickly. 
Concerning in-group dynamics, game theory can help to clarify the micro- 
foundations for the intragroup processes discussed previously in the section on 
case-based pattern finding. Game theoretic approaches, attuned to the individ- 
ual level of analysis, do not assume—as do many theorists of ethnic con- 
flict—that members of ethnic groups share a common vision or common inter- 
ests. Kuran (1998a, 1998b) assumes that people have distinct preferences for 
some combination of ethnically marked and generic, ethnically indifferent 
consumption (including not only goods but activities, modes of association, 
policies, and so on). Ethnic entrepreneurs, who will be more successful to the 
extent that their constituents favor ethnic over generic consumption, try to in- 
duce the former at the expense of the latter. Such pressures, and constituents’ 
interdependent responses to them, can trigger ethnification cascades—sharp 
and self-sustaining shifts from ethnically neutral to ethnically marked activi- 
ties that divide once integrated societies into separate ethnic segments between 
whom violence is much more likely to flare up, and spread, than between the 
same individuals before the “cascade.” Laitin (1995b) uses a cascade model 
similar to that of Kuran. He assumes that ethnic activists, in the context of a na- 
tional revival, will use tactics of humiliation to induce co-nationals to invest in 
the cultural repertoires of the dormant nation. But when humiliation fails, and 
when activists fear that no cascade toward the national revival is possible, they 
will consider the possibility of inducing both intra- and interethnic violence. 


Rational Action Theory 


Rational action perspectives on ethnicity and nationalism have proliferated in 
recent years (Rogowski 1985, Meadwell 1989, Banton 1994). Yet despite an 
abundance of informal observations concerning the strategic, calculated, or 
otherwise instrumental dimensions of ethnic or nationalist violence, few sys- 
tematic attempts have been made (apart from the international relations and 
game-theoretic traditions mentioned above) to analyze ethnic and nationalist 
violence as such from a rational action perspective. One exception is Hechter 
(1995), who claims that “nationalist violence can best be explained instrumen- 
tally.” Hechter argues that while the dispositions linked to emotional or ex- 
pressive violence are distributed randomly in a population, and thus have no 
effect at the aggregate level, the dispositions underlying instrumental violence 
are clustered systematically and thus are decisive at the aggregate level. This 
argument presupposes that the dispositions underlying emotional or expres- 
sive violence are idiosyncratic individual characteristics, yet surely such pow- 
erful violence-fostering emotions as rage or panic-like fear may be clustered 
systematically at particular places and times and thus may be significant at the 
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aggregate level. But Hechter does stake claim to territory into which rational- 
ists—for all their expansionist inclinations—have so far hesitated to tread. He 
also clearly states a series of propositions about the relation between group 
solidarity, state strength and autonomy, and oppositional nationalist violence. 
Another exception is Hardin (1995), who applies broadly rational choice per- 
spectives (following Olson 1975) to the formation of ethnic groups and their 
development of exclusionary norms and then relies on an informal game 
model to explain how groups with such norms can “tip” toward violence. 

Blalock's general theory of power and conflict (1989), though not specifi- 
cally addressed to ethnic or nationalist violence, analyzes structures, mecha- 
nisms, and processes that are often implicated in such violence. These include 
the small, disciplined "conflict groups" specifically organized to carry out vio- 
lence and the mechanisms through which protracted conflicts are sustained or 
terminated. He adopts a modified rational-actor persepective—modified in 
emphasizing structures of power and dependency and allowing for non- 
economic goals and the the role of misperception, deception, ideological bias, 
and so on in shaping the subjective probabilities on the basis of which action is 
undertaken. 


CULTURALIST APPROACHES 


Culturalist analyses of ethnic and nationalist violence reflect the broader “cul- 
tural turn” the social sciences have taken in the past 20 years. Although such 
analyses are extremely heterogeneous, they generally characterize ethnic vio- 
lence as meaningful, culturally constructed, discursively mediated, symboli- 
cally saturated, and ritually regulated. Some culturalist analyses expressly re- 
ject causal analysis in favor of interpretive understanding (Zulaika 1988) or 
adopt a stance of epistemological skepticism (Pandey 1992, Brass 1997). Yet 
for the most part, culturalist accounts do advance explanatory claims, although 
the status and precise nature of the claims are not always clear. Here we sketch 
a few clusters of recurring themes in culturalist analyses. 


The Cultural Construction of Fear 


Like the rational action approaches just considered, culturalist approaches 
seek to show that even apparently senseless ethnic violence “makes sense” 
(Kapferer 1988) in certain contexts. Yet while they claim to discover a “logic” 
to ethnic and ethnoreligious violence (Spencer 1990, Zulaika 1988, Juergens- 
meyer 1988) and reject representations of it as chaotic, random, meaningless, 
irrational, or purely emotive, culturalists claim that such violence makes sense 
not in instrumental terms but in terms of its meaningful relation to or resonance 
with other elements of the culturally defined context. 
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Culturalist analyses construe the relevant context in different ways. One 
major focus of attention has been on the cultural construction of fear, on the 
rhetorical processes, symbolic resources, and representational forms through 
which a demonized, dehumanized, or otherwise threatening ethnically defined 
"other" has been constructed. The social construction of fear, to be sure, is not 
a new theme in analyses of ethnic violence. It was central to Horowitz 
(1985:175-184), who in turn drew on a generation of work in social psychol- 
ogy. Yet while Horowitz sought to elaborate a universal “positional group psy- 
chology" to account for cross-cultural regularities in patterns of ethnic antipa- 
thy and anxiety, recent culturalist accounts have tended to emphasize particu- 
lar features of individual cultural contexts; they have emphasized the cultural 
and historical rather than social psychological grounding of ethnic fear. A lit- 
erature has emerged on the construction of fearful Hindu beliefs about Mus- 
lims in India (in the context of opposed ethnoreligious idioms and practices, 
religiously justified social segregation, and the rise of militant Hindu national- 
ism) (Gaborieau 1985, Pandey 1992, Hansen 1996); of Sinhalese beliefs about 
Tamils in Sri Lanka (in the context of an ethnocratic Sinhalese state, Tamil ter- 
rorism, state repression, and unchecked rumor) (Spencer 1990); and of Serbian 
beliefs about Croats in disintegrating Yugoslavia (in the context of a national- 
izing Croatian successor state symbolically linked to, and triggering memories 
of, the murderous wartime Ustasha regime) (Glenny 1992, Denich 1994). 
Once such ethnically focused fear is in place, ethnic violence no longer seems 
random or meaningless but all too horrifyingly meaningful. 

Without using the term, culturalist analyses have thus been concerned with 
what we discussed above as the security dilemma—with the conditions under 
which preemptive attacks against an ethnically defined other may “make 
sense." Unlike the international relations approaches to the security dilemma, 
however—and unlike political and economic approaches to ethnic violence in 
general—culturalist approaches seek to specify the manner in which fears and 
threats are constructed through narratives, myths, rituals, commemorations, 
and other cultural representations (Atran 1990). Culturalist analyses thus see 
security dilemmas as subjective, not objective, and as located in the realm of 
meaning and discourse, not in the external world. Many cultural analyses (e.g. 
Tambiah 1996, Bowman 1994) acknowledge the crucial role of ethnic elites in 
engendering ethnic insecurity through highly selective and often distorted nar- 
ratives and representations, the deliberate planting of rumors, and so on. But 
the success of such entrepreneurs of fear is seen as contingent on the histori- 
cally conditioned cultural resonance of their inflammatory appeals; cultural 
“materials” are seen as having an inner logic or connectedness that makes them 
at least moderately refractory to willful manipulation by cynical politicians. 

Although such accounts may be plausible, even compelling “on the level of 
meaning" (Weber 1968:11), they have two weaknesses. The first is eviden- 
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tiary: It is difficult to know whether, when, where, to what extent, and ın what 
manner the posited beliefs and fears were actually held. How do we know that, 
in India, the most *rabid and senseless Hindu propaganda," "the most outra- 
geous suggestions" about the allegedly evil, dangerous, and threatening Mus- 
lim “other,” have come to be “widely believed,” and to constitute “a whole 
new ‘common sense’” (Pandey 1992:42—43, Hansen 1996)? How do we know 
that, in Sri Lanka in 1983, Tamils were believed to be “superhumanly cruel 
and cunning and, like demons, ubiquitous” (Spencer 1990:619) or “agents of 
evil,” to be rooted out through a kind of “gigantic exorcism” (Kapferer 
1988:101)? How do we know that, in the Serb-populated borderlands of Croa- 
tia, Serbs really feared Creats as latter-day Ustashas? Lacking direct evidence 
(or possessing at best anecdotal evidence) of beliefs and fears, culturalist ac- 
counts often rely on nationalist propaganda tracts (Pandey 1992:43, Lemar- 
chand 1996:Chapter 2) but are unable to gauge the extent to which or the man- 
ner in which such fearful propaganda has been internalized by its addressees. 
[Malkki (1995) has attempted to document the extent of such internalization in 
her fieldwork among Hutu refugees from Burundi, but because this work con- 
cerns the victims of near-genocidal violence, not the perpetrators, it speaks 
most directly to the consequences rather than to the causes of ethnic vio- 
lence—although consequences of past violence can become causes of future 
violence in the course of a long-term cycle of intractable violent conflict (Le- 
marchand 1996, Atran 1990)]. 

The second problem is that such accounts (though culturalist accounts are 
not alone in this respect) tend to explain too much and to overpredict ethnic 
violence. They can not explain why violence occurs only at particular times 
and places, and why, even at such times and places, only some persons partici- 
pate in it. Cultural contextualizations of ethnic violence, however vivid, are 
not themselves explanations of it. 


Framing Conflict as Ethnic 


In southern Slovakia in 1995, a pair of Hungarian youths were pushed from a 
train by Slovakian youths after a soccer match. Although one of the youths was 
seriously injured, and although the incident occurred after the Hungarians had 
been singing Hungarian nationalist songs, the violence was interpreted as 
drunken behavior by unruly soccer fans rather than as ethnic violence, and 
even the nationalist press in Hungary made no attempt to mobilize around the 
incident (Brubaker field notes). Similarly, the burning down of an Estonian 
secondary school in a predaminantly Russian region of Estonia in 1995 was in- 
terpreted as a Mafia hit, even on the Estonian side, and no mobilization oc- 
curred, even though no one could suggest why the Mafia might have been in- 
terested in a secondary school (Laitin field notes). These 1ncidents illustrate 
what we alluded to in the introduction as the constitutive significance of cod- 
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ing or framing processes in ethnic violence. The "ethnic" quality of ethnic vio- 
lence is not intrinsic to the act itself; it emerges through after-the-fact interpre- 
tive claims. Such claims may be contested, generating what Horowitz (1991a:2) 
has called a metaconflict—a “conflict over the nature of the conflict" that may, 
in turn, feed back into the conflict in such a way as to generate (by furnishing 
advance legitimation for) future violence (Lemarchand 1996:Chapter 2, 
McGarry & O'Leary 1995). Such social struggles over the proper coding and 
interpretation of acts of violence are therefore worth studying in their own 
right (Brass 1996a, 1997; Abelmann & Lie 1995) as an important aspect of the 
phenomenon of ethnic violence. 


Gender 


Like other forms of violence and war, and like the phenomena of ethnicity and 
nationhood in general (Verdery 1994), ethnic and nationalist violence is 
strongly gendered. The Basque ETA and the Irish Republican Army (IRA), for 
example, are overwhelmingly male (Waldmann 1989:154, Zulaika 1988:182), 
although Aretxaga (1995:138) discusses women's efforts to be recognized as 
full members of the IRA rather than of its women's counterpart. As victims of 
ethnic violence, women are sometimes deliberately spared, at other times de- 
liberately targeted [for example, in the notorious mass rapes of Bosnia Muslim 
women by Bosnian Serbs (Koraé 1994)]. More research is needed on the spe- 
cific roles that women may play in certain ethnic riots, not necessarily as direct 
perpetrators but, for example, in shaming men into participating (Hansen 
1996:153). Katz (1988) argues that while women as well as men are suscepti- 
ble to the “seductions of crime," the characteristic modalities of women’s 
criminal activities are different; we might expect the same to be true of ethnic 
violence. 

The representation of ethnic violence is also strongly gendered. Recent re- 
search on nationalism shows that 1n many settings, prospective threats to (as 
well as actual attacks on) "the nation" are construed as a feared or actual viola- 
tion or rape of an "innocent, female nation" by a brutal male aggressor (Harris 
1993:170, Verdery 1994:248—249). To defend or retaliate against such threats 
or attacks, conspicuously masculinist virtues may be asserted in compensatory 
or overcompensatory fashion. In India, for example, Hindu nationalist organi- 
zations offer a “way of recuperating masculinity” to their recruits, enabling 
them to *overcome the [stereotypically] 'effeminate' Hindu man and emulate 
the demonized enemy, the allegedly strong, aggressive, militarized, potent and 
masculine Muslim" (Hansen 1996:148, 153). 


Ritual, Symbolism, Performance 


A number of analysts—echoing themes from the Manchester school of social 
anthropology (Gluckman 1954, Turner 1969)—have underscored the ritual- 
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ized aspects of ethnic violence. Gaborieau (1985) highlights "rituals of provo- 
cation," which he describes as “codified procedures” of deliberate disrespect, 
desecration, or violation of sacred or symbolically charged spaces, times, or 
objects—in India, for example, the killing of cows by Muslims, or the distur- 
bance of Muslim worship by noisy Hindu processions (on noise as a cultural 
weapon in ethnoreligious struggles, see Roberts 1990). Marches and proces- 
sions through space “owned” by another group have triggered violence in 
Northern Ireland and India with sufficient regularity and predictability to war- 
rant calling these too rituals of provocation (Feldman 1991:29—30, Jaffrelot 
1994, 'Tambiah 1996:240). Even without deliberate provocation, conflicting 
claims to the same sacred spaces (Ayodhya, Jerusalem) or sacred times (when 
ritual calendars overlap) may provide the occasion for ethnic violence (Van 
der Veer 1994; Tambiah 1996:Chapter 9; Das 1990b:9ff; Friedland & Hecht 
199], 1996). Freitag (1989) and Tambiah (1985:Chapter 4, 1996:Chapter 8) 
have applied what the latter calls a semiotic and performative perspective on 
rituals and public events to ethnic confrontations, disturbances, and riots in 
South Asia. Performance and ritual are also emphasized in Zulaika's (1988) 
study of the cultural context of violence in a Basque village. Van der Veer 
(1996) sees riots as a form of ritual antagonism expressing an opposition be- 
tween the self and an impure, alien, or demonic "other." Following Davis's 
(1973) analysis of the "rites of violence" in the religious riots of sixteenth- 
century France, analysts of ethnic riots have called attention to the ritualized 
nature and symbolic resonance of the seemingly gratuitous forms of mutilation 
often involved (e.g. hacking off of body parts, desecration of corpses). 

Feldman's (1991) study of Northern Ireland is the most sustained discus- 
sion ofthe symbolic dimension of ethnic violence. Feldman focuses on the eth- 
nically charged symbolism of urban space in Belfast, the increasing ethnic par- 
titioning of which is both a consequence of ethnic violence and a reinforcing 
cause of future violence. He also analyzes the equally charged symbolism of 
the body. Ironically, given his critique of instrumental analyses of ethnic vio- 
lence, Feldman devotes a great deal of attention to the body as an instrument, 
as a weapon deployed by those (in his case, IRA prisoners) for whom it is the 
only resource. Of course, as he shows in rich detail, this instrumentalization of 
the body through the "dirty protest" (in which prisoners denied special politi- 
cal status refused to wear prison clothing and smeared feces on the walls) and 
the subsequent hunger strike (in which 10 prisoners died) was achieved in 
symbolically resonant form [analyzed also by Aretxaga (1993, 1995), the latter 
piece focusing on female prisoners’ own “dirty protest," centered on the dis- 
play of menstrual blood]. 

It should be emphasized that no serious culturalist theory today argues that 
violence flows directly from deeply encoded cultural propensities to violence 
or from the sheer fact of cultural difference. In this salutary sense, there are no 
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purely culturalist explanations of ethnic violence; and it is difficult to simply 
classify as culturalist a work such as Tambiah (1996), in which cultural, eco- 
nomic, political, and psychological considerations are deftly interwoven. By 
considering separately culturalist approaches, we do not imply that they are or 
ought to be segregated from other approaches. We suggest, rather, that such ap- 
proaches highlight aspects of ethnic violence—discursive, symbolic, and ritu- 
alistic aspects—that should ideally be addressed by other approaches as well. 


CONCLUSION: A PLEA FOR DISAGGREGATION 


The temptation to adopt currently fashionable terms of practice as terms of 
analysis 1s endemic to sociology and kindred disciplines. But it ought to be re- 
sisted. The notion of "ethnic violence" is a case in point—a category of prac- 
tice, produced and reproduced by social actors such as journalists, politicians, 
foundation officers, and NGO representatives, that should not be (but often 1s) 
taken over uncritically as a category of analysis by social scientists. Despite 
sage counsel urging disaggregation (Snyder 1978, Williams 1994, Horowitz 
forthcoming), too much social scientific work in this domain (as in others) in- 
volves highly aggregated explananda, as if ethnic violence were a homogene- 
ous substance varying only in magnitude. To build a research program around 
an aggregated notion of ethnic violence is to let public coding—often highly 
questionable, as when the Somali and Tadjikistani civil wars are coded as eth- 
nic—drive sociological analvsis. 

The paradigmatic instances of ethnic and nationalist violence are large 
events, extended in space and time. Moreover, they are composite and causally 
heterogeneous, consisting not of an assemblage of causally identical unit in- 
stances of ethnic violence but of a number of different types of actions, pro- 
cesses, occurrences, and events. For example, it is evident from the case litera- 
ture that in Sri Lanka “ethnic violence” consists of episodic riots on the one 
hand and more continuous low-level terrorism (and state violence in response 
to the terrorism) on the other, all occurring against the background of the “cul- 
tural violence” perpetrated by a series of ethnocratic Sinhalese governments. 
Not only do the riots, terrorism, and state violence involve sharply opposed 
mechanisms and dynamics (in terms of degree and mode of organization, 
mode of recruitment and involvement of participants, affective tone, symbolic 
significance, contagiousness, degree and modality of purposeful rationality, 
and so on), but within each category there is also a great deal of causal hetero- 
geneity. Thus an ethnic riot typically involves at one level deliberate manipu- 
lation and organization by a small number of instigators but also, at other lev- 
els, turbulent currents of crowd behavior governed by powerful emotions and 
compelling collective representations requiring social psychological and cul- 
tural modes of analysis. 
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There is no reason to believe that these heterogeneous components of large- 
scale ethnic violence can be understood or explained through a single theoreti- 
cal lens. Rather than aspire to construct a theory of ethnic and nationalist vio- 
lence—a theory that would be vitiated by its lack of a meaningful explanan- 
dum—we should seek to identify, analyze, and explain the heterogeneous pro- 
cesses and mechanisms involved in generating the varied instances of what we 
all too casually lump together— given our prevailing ethnicizing interpretive 
frames—as "ethnic violence.” This can be accomplished only through a re- 
search strategy firmly committed to disaggregation in both data collection and 
theory building. 
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ABSTRACT 


This paper characterizes the field of sociological theory since the mid-1980s 
as the site of eight active and diverse intellectual projects. These projects are 
(D) to construct general analytical tools for use in empirical social research, 
(ID to synthesize multiple theoretical approaches; (IIT) to refine existing 
theoretical research programs; (IV) to stimulate dialogue among different 
theoretical perspectives; (V) to enlarge and reconstruct current theoretical 
approaches conceptually, methodologically, socially, and politically; (VT) to 
analyze a range of past theoretical ideas; (VIT) to offer a diagnosis of contem- 
porary social conditions; and (VID to dissolve the enterprise of sociological 
theory. We discuss the contours of these projects and 1dentify some of the 
major ideas and theorists associated with each. We conclude with a brief dis- 
cussion of the organizational structure of the contemporary theory field, ob- 
serving that most current theoretical projects are formulated with msufficient 
attention to their conditions of possibility 


INTRODUCTION 


Sociological theory has a long history of taking stock of itself. It lies among 
those distinctive academic specialties where a general review of the existing 
literature is not an occasional undertaking but part of the routine activity of 
many members of the area, a feature of every required survey course on theory, 
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and a necessary component of every work purporting to be an original contri- 
bution to the field. 

Under these circumstances there exists no shortage of recent articles, mono- 
graphs, textbooks, and edited collections analyzing contemporary develop- 
ments in the theory field, a subject also addressed in previous volumes of the 
Annual Review of Sociology. Among these various contributions, Giddens & J 
Turner (1987), Ritzer (1988c, 1990b), B Turner (1996b), J Turner (1991), and 
R Wallace & Wolf (1995) all discuss trends in the theory area; the theorists, 
texts, and intellectual currents making up these trends; and the substantive ar- 
guments and conceptual frameworks associated with these developments. 

This chapter does not substitute for these detailed reviews. Its primary aim 
is not to resume the conceptual schemes and substantive arguments found in 
contemporary theories but to identify and briefly overview some of the major 
intellectual projects currently under way in the theory area. In research on ear- 
lier historical periods, we have shown that attention to a theorist's intellectual 
objectives and purposes increases understanding of the conceptual tools and 
empirical claims he or she articulated (e.g. Camic 1989, Gross 1997). In speak- 
ing in this chapter of "projects," we make a preliminary effort to extend this 
approach to the contemporary period by focusing on some principal tasks that 
those in the theory area are now seeking to accomplish, on the organizing pro- 
grams or agenda that contributors to the field are developing their concepts and 
arguments 1n order to advance, and on the descriptive and evaluative assess- 
ments of the present state of the theory area that frequently accompany these 
programs. 

We begin by identifying eight such projects and conclude with a discussion 
of their organizational conditions of possibility. All eight projects are contem- 
porary, found in the theory literature in the period on which we concentrate, 
the mid-1980s to the present. Analysis of other periods would necessarily pro- 
duce a different set of programs, whose degree of overlap with the following 
listing would be an empirical question. Throughout, we deal primarily with 
American developments, generally incorporating works by European theorists 
only insofar as these have appeared in English since the mid-1980s and have 
since then figured consequentially into American discussions. Even taking 
into account these restrictions, this survey of the literature is selective, and the 
eight projects do not provide an exhaustive charting of lines of current theo- 
retical activity. 

Furthermore, it should be recognized that these eight projects are not, as a 
rule, mutually exclusive of one another. There are theorists who have pursued 
more than one, both simultaneously and in different career stages, as we occa- 
sionally note, though it is not possible here to devote sufficient attention to par- 
ticular individuals fully to characterize their projects over time or even in sin- 
gle works. We also emphasize that our eight project categories are internally 
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heterogeneous and that they bring together theorists who differ in other re- 
spects while separating those often grouped as similar in other scholarly classi- 
fications. But this simply testifies to the value of multiple classification 
schemes: to the fact that thinkers with a common agenda for sociological the- 
ory may differ in the concepts and arguments they use to advance this agenda, 
just as theorists who converge in argument and concept may use these means 
to advance very different projects within the theory area. 

In discussing contemporary sociological theory, we consider an area with 
notoriously fuzzy boundaries. In identifying what works fall within these 
boundaries, we borrow Wittgenstein's notion of “family resemblances.” Witt- 
genstein (1953:31—33) analyzes groups of elements that “have no one thing in 
common which makes us use the same word for all” elements; just as there are 
“various resemblances between members of a family, build, features, colour of 
eyes, gait, temperament, etc. etc.," so there are "family resemblances" among 
the elements of these other groups, i.e. *a complicated network of similarities 
overlapping and criss-crossing: sometimes overall similarities, sometimes 
similarities of detail.” It is such family resemblances that constitute the con- 
temporary theory field: criss-crossing similarities 1n terms of analytical issues 
and problems, intellectual ancestory and points of departure, vocabulary and 
style of argument, self-identification (calling one's own work "theory", insti- 
tutional membership (belonging to theory sections of sociological associa- 
tions), group adoption (having one's contribution embraced by theorists), and 
more. No fixed cluster of these traits defines a family member nor makes it 
possible to track down all the stepchildren, distant cousins, and black sheep. 
Nonetheless, after one has spent some time among family members, it is not 
difficult to recognize the different branches of the family tree. 


EIGHT PROJECTS FOR CONTEMPORARY 
SOCIOLOGICAL THEORY 


Project I: Construction of General Tools for Use in 
Empirical Analysis 


For one group of contemporary theorists, the principal task of sociological the- 
ory is to build analytical tools— concepts, explanatory propositions, interpre- 
tive guidelines, etc—4directly applicable in the study of empirical problems. 
This project, according to many of its recent proponents, is increasingly threat- 
ened by abstract, self-referential theorizing that distances itself from the sub- 
stantive issues that arise in areas of empirical social research. 

This is the position of Chafetz: *[S]ociological theory is integrally related 
to research" (1988:2) and “ghettoizes” itself when it retreats into “abstract 
epistemological and ontological [discussion of issues such as] agency/micro 
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and structure/macro" (1993:60—62). Opposing this retreat, Chafetz sets for 
theory the task of developing a “diverse set of practical tools"— general ex- 
planatory statements and the concepts they contain—“‘from which one can se- 
lect those most helpful in solving any given [empirical] problem." She carries 
this agenda forward by articulating a multivariable, "eclectic structural the- 
ory” of the causes of gender stratification (1988:1, 51—54). 

Along similar lines, Rule (1997) objects that the theory field is now a “ca- 
cophony" of discordant approaches (rational choice, network analysis, etc), 
each of limited range, and has lost sight of the discipline's "perennial is- 
sues —[its substantive questions about deviance, economic growth, civil vio- 
lence, etc. For Rule, “the development of analytical tools” that address such 
questions is the principal task for contemporary theory, a task he himself takes 
up by formulating general propositions about the sources of civil violence 
(1997:1, 5, 19, 261—17; see also Rule 1988, Skocpol 1986). Calhoun, too, calls 
for ^more studies that seek to advance theory in the cause of contemporary so- 
ciological research and understanding, as distinct from those which aim 
mainly at clarifying what [theorists] have already said" (1997:2). His prefer- 
ence, however, is for theory to supply not a set of general propositions and con- 
cepts but culturally sensitive "guidelines" or “frameworks for interpretation" 
of changing social practices (Calhoun 1996b:86, 92; see also Calhoun 1995, 
1996a). 

Pierre Bourdieu's writings (1988, 19902, 1990b, 19932, 1993b, Bourdieu & 
Wacquant 1992) offer perhaps the most elaborate case for this project. Reject- 
ing "empty theoreticism"— general “programmatic discourse that is its own 
end," focused on other abstract theories and unwilling “to sully [its] hands in 
empirical research”—Bourdieu’s agenda is to develop a “set of conceptual 
tools and procedures for constructing objects and for transferring knowledge 
[from] one area of inquiry into another" (Bourdieu 1988:774, 777, Bourdieu 
1993b:45, Bourdieu & Wacquant 1992:5). Unlike other advocates of this pro- 
gram, however, Bourdieu does not envision these tools as a broad, eclectic as- 
semblage suited to piecemeal appropriation. He concentrates instead on a lim- 
ited set of concepts: most famously, “habitus,” the "ensemble of dispositions" 
toward action and perception that operates from within social agents; and 
“field,” the configuration of relations between social positions, or the struc- 
tured space where social struggles unfold (Bourdieu & Wacquant 1992: 
16—19). Insisting on the *two-way relationship between habitus and field," 
Bourdieu seeks to transcend the hoary intellectual antinomy between objectiv- 
ism and subjectivism by "integrating into a single model the analysis of the ex- 
perience of social agents and the analysis of the objective structures that make 
this experience possible" (1988:782—84) and by then deploying this model in 
empirical studies of spheres ranging from art and science to the economy and 
law. But he does not propose this model as a general "systematization" or a 


DEVELOPMENTS IN SOCIOLOGICAL THEORY 457 


universal “discourse on the social world"; it is a temporary “machine for re- 
search" that “accomplishes and abolishes itself 1n the scientific work it has 
helped produce" (1993b:29; Bourdieu & Wacquant 1992:159—61; for discus- 
sion, see Calhoun et al 1993, Swartz 1997). 

In contrast, W Wallace believes it is at last time for the theorist to codify a 
general “metalanguage,” “a single conceptual matrix”—consisting of eight ba- 
sic descriptive and twelve explanatory variables—for use throughout sociol- 
ogy, the basis for a "discipline-wide consensus" (1988:60). With this program, 
he veers toward Project II, though Wallace makes no claims for a theoretical 
synthesis. He regards his matrix as furnishing only a general "nomenclature" 
for empirically oriented work, i.e. for the creation of “many kinds of descrip- 
tions and explanations" (1988:60, 1983:9). 


Project II: Synthesis of Multiple Theoretical Approaches 


The analysis of existing social theories that is often criticized by proponents of 
Project I actually animates a second contemporary project. This project rests 
on the conviction that it is now possible to achieve a comprehensive synthesis 
of previously divergent theoretical perspectives. According to some theorists, 
such a synthesis is well under way; for others, it is a vital opportunity now to be 
seized. 

A decade ago, Ritzer (1988a, 19902) forecast a coming era of theoretical 
synthesis, and today efforts toward synthesis can be found in various quarters: 
in Scheff ’s program to “assimilate [contending] theoretical proposals in the 
human sciences...within a much larger matrix” by use of a “micro-linguistic 
analysis of discourse” situated in “the context of larger wholes” (1997:7—10, 
see also 1990); in Runciman’s macro-historical “theory of social relations, so- 
cial structure, and social evolution,” offered as a “general synthesis” aiming to 
“do for the study of societies what Darwin [did] for the study of species” 
(1989b:60, 449; 1989a:13; see also 1983); and in Emirbayer’s recent “mani- 
festo for a relational sociology” that tries to rework “micro” and “macro” and 
to forge a “unitary frame of reference” upon which diverse social thinkers are 
said to be “fast converging” (1997:311—12; Emirbayer & Goodwin 1994). 

The project of theoretical synthesis has also been embraced by some of the 
most widely discussed figures in contemporary theory. Included here are those 
broadly sympathetic with the research focus of Project I. J Turner, for exam- 
ple, has urged that steps be taken to break down the barriers that divide theo- 
rists and to advance toward a “theoretical synthesis” at the macro, meso, and 
micro levels-—especially the last, where he has developed a synthetic theory of 
social interaction (1987, 1988, 1991). Recognizing that a theory unifying all 
levels may be far off, Turner believes that, at each level, “existing theories 
have captured many of the operative dynamics of the social universe,” bring- 
ing sociology near to empirically testable general theory of the type found in 
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the natural sciences: i.e. to timeless principles that cut across substantive areas 
(1989:17, 1991:591). He fears, however, that this goal may be defeated by the 
“anti-positivism” of other theorists and by various organizational divisions 
within sociology (1990, 1992; S Turner & J Turner 1990). Somewhat more op- 
timistic, R Collins envisions a “comprehensive theory [for] every arena of so- 
ciety"—theory in the form of "generalized explanatory models” of basic so- 
cial processes—though this will not take shape immediately (1986a:1351, 
1989: 124, 1981b, 1992). Given the discipline's current fragmentation, the 
theorist's present task is to promote “mutual confrontation" of available theo- 
retical traditions and to bridge productive areas of substantive research in or- 
der "to compare, to synthesize, and to cumulate"—as Collins has done in iden- 
tifying power and status as central dimensions of all interaction and in propos- 
ing "interaction ritual chains" as part of the micro-foundation of macro- 
structure (1994:295, 1986a:1355, 19818; Kemper & R Collins 1990). (See also 
the program for a unified "comprehensive theory" proposed by Fararo 1989a, 
1989b; Fararo & Skvoretz 1986; Skvoretz & Fararo 1996.) 

Important variants of the synthesis project have also appeared among those 
for whom theory has greater autonomy from empirical research. Equating the- 
ory with general “presuppositions” about human action and social order, Alex- 
ander has argued for the imminent convergence at this level of all major—and 
hitherto one-sided—-theories, classical and contemporary, announcing that 
"synthetic rather than polemical theorizing is now the order of the day" 
(1988d:77). As to the content of this "new movement," Alexander has vari- 
ously predicted and promoted the following: a "multidimensional" synthesis of 
“normative and instrumental" views of action, “material and ideal” views of 
order (1982—1983); a Parsons-inspired “neofunctionalism” that “relink[s] theo- 
rizing about action and order, conflict and stability, structure and culture” (Alex- 
ander & Colomy 1990:57, Alexander 1985); a post-Parsonsian “micro-macro 
synthesis” integrating “action and structure, subjectivity and objectivity” (Alex- 
ander & Giesen 1987:4; Alexander 1987:376—77, 1988b,d); and a "new form 
of synthetic social theory" that moves beyond neofunctionalism and carries the 
micro-macro synthesis in the direction of "culturalist theories opened up to the 
model of culture-as-language" (Alexander 1998:288, 1988c, 1995). Along 
parallel lines, Münch's earlier call (11982]1987, [1982]1988) for a “synthesis” 
based on Parsonsian action theory has become an agenda for a “new synthesis" 
connecting "Parsonian theory and competing theoretical approaches" that treat 
the economics, politics, and symbolics of action (1987:149, 1994). 

Giddens (1984, 1987, 1995, 1996) has pursued a similar cause, forecasting 
a revolutionary “new synthesis"—open to work outside sociology and geared 
to the world society of the next century—to replace the competing theories of 
the past. Key to this change, in his view, is theoretical advance, seen not as cu- 
mulation of natural science-like generalizations but as "conceptual innova- 
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tion." To this end, he faults structuralism and functionalism for overlooking 
that actors are “vastly skilled in the practical accomplishments of social activi- 
ties," while also criticizing phenomenology and ethnomethodology for ne- 
glecting the "structural constraint" that affects such activities (1987:43, 1984: 
xxvii, 26). He proposes "structuration theory" as the alternative that both rec- 
ognizes the “duality of structure"—the fact that “the structural properties of 
social systems [1.e. their rules and resources] are both medium and outcome of 
the practices they recursively organize"—and problematizes, for “specific his- 
torical circumstances, the relation between knowledgeable activity . . . and so- 
cial reproduction brought about in an unintended fashion" (1984:25, 1996:72; 
for discussion, see Cohen 1989). More ambitiously still, Luhmann (1985, 
1987, 1995), alarmed over sociology's “theory crisis," draws on work in biol- 
ogy and cybernetics to fashion what he sees as a unified theory of self- 
referential, or autopoietic, social systems Conceiving such systems (societies, 
interactions) to be composed not of agents or actions but of communication, 
and analyzing how these systems evolve or adapt with respect to their environ- 
ments through meaningful communication, Luhmann views this approach as 
providing the needed basis for a “comprehensive theory of the social," a theory 
of "everything social" (1995:xxxvii, xlvii). 

The paradox of these diverse synthetic projects has not escaped notice. Uni- 
fied though they are on a program of synthesis, they generally diverge on the 
substance of this program, each thereby undercutting the other's case for the 
reality ofthe particular synthesis in question. To be sure, there are terminologi- 
cal overlaps (agency/structure, micro/macro), and overcoming these divisions 
is a concern shared by several (though not all) projects, much the way calls to 
reconcile order/change or consensus/conflict were commonplace a generation 
ago. But even strong advocates of synthesis have conceded that current theo- 
ries use terms like micro and macro very differently and may not “offer ade- 
quate bases for dealing with micro-macro integration” (Ritzer 1988b:705—6). 
surveying the plurality of options, S Turner & J Turner (1990:170) flatly state 
that “recent synthetic theories...have not been successful" at theoretical unifi- 
cation, while Holmwood & Stewart (1994)—in a kind of reversion to Project 
]—take failed syntheses as an occasion to broach new explanatory problems. 


Project III: Refinement of Theoretical Research Programs 


For observers like B Turner (1989, 19962), the heterogeneity of synthetic theo- 
ries makes “fragmentation and division rather than successful accumulation 
[the] dominant trend in sociological theory," though there are exceptions to 
this pattern. They are found in pockets of the discipline once called 
"schools"—i.e. in areas committed to the ongoing elaboration of a single theo- 
retical orientation closely tied to an empirical research tradition (B Turner 
1996a:9—10). The continuing development of several of these “theoretical re- 
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search programs" (to borrow a term from D Wagner and Berger 1985) consti- 
tutes a third major project in the field of contemporary theory. 

Visible among these approaches has been rational-choice theory. The ru- 
bric encompasses different positions, some suggestive ofthe synthetic goals of 
Project II (see Abell 1996a, Coleman 1989, 1993, Lindenberg 1986). Typi- 
cally, however, the approach offloads many topics onto other theories to con- 
centrate on constructing formal models of the processes by which interdepend- 
ent, utility-maximizing actors "combine to produce social outcomes"; it then 
seeks to elaborate these models and extend their empirical range (Coleman & 
Fararo 1992a:xi-xii; Coleman 1990; Hechter 1983; Abell 1996b). With this in 
view, rational-choice theorists are now working to incorporate "social struc- 
ture into models of individual choice" (Macy & Flache 1995) and to use these 
models in areas where they would seem inapplicable, e.g. in studies of family, 
religion, and gender, and in research on social solidarity (Hechter 1983, 1987; 
Hechter & Kanazawa 1997; for criticism, see Barnes 1995, Coleman & Fararo 
1992b, England & Kilbourne 1990, Smelser 1998). A similar theoretical proj- 
ect is also being cultivated within Analytical Marxism. Here formal rational- 
choice models are again embraced and elaborated, but now as part of a pro- 
gram to bring Marxism out of the nineteenth century, to clarify its core con- 
cepts, to place its *macro-structural" theses on *micro-foundations" that spec- 
ify their causal mechanisms, and to apply this perspective in research on class 
structure, exploitation, etc (Wright 1989:47—49; Elster 1985; Mayer 1994; 
Roemer 1986; for critique, see Burawoy 1986, 1989). 

Another current focus for the project of theoretical refinement is expec- 
tations-states theory, which has continued its effort to formulate concepts and 
general propositions to describe and explain the status structure of small 
groups, to extend these ideas to further aspects of group interaction processes, 
and to test these formulations empirically (Berger et al 1989, Berger & 
Zelditch 1993, Fisek et al 1995, D Wagner & Berger 1985). This program has 
spawned several major subprograms, plus active attempts to integrate these, 
each new step hailed as calling for “more explicit formalization of the theory” 
and "research to test its more subtle implications" (D Wagner & Berger 
1993:48). In these ways, the expectations-states program mirrors develop- 
ments in other formal approaches to the analysis of interaction, such as ex- 
change theory, network analysis, and projects combining the two (see Burt 
1992, Cook 1987, Cook & Whitmeyer 1992, White 1992; cf Blau 1994). 

Other elaborative projects are under way in connection with less formal 
theoretical approaches. Rebounding from its relative quiescence during the 
1960s and 1970s, symbolic interactionism, for example, 1s now in a state of 
“revitalization” and “ferment,” retaining some of its traditional concern with 
the concrete interactional basis of meaning, identity, and self, but moving 1n 
new directions: on the one hand by diversifying and “turn[ing] increasingly to- 
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ward power, history, and distinctive versions of social structure" as subjects 
for interactionist research and theorizing (Plummer 1996:236—38); and on the 
other hand by bridging outward (as more tightly knit approaches such as ra- 
tional choice have yet to dp) to connect with other contemporary perspectives, 
notably cultural studies and postmodernism (Denzin 1992, Fine 1993, Goff- 
man 1983, Joas 1987, Becker & McCall 1990, Stryker 1987). In ethnomethod- 
ology, a diversifying but continually developing theoretical research program 
is also clearly in evidence (Maynard & Clayman 1991). 


Project IV: Dialogue Among Multiple Theoretical Approaches 


If the multitude of positions found within the theory area has led numbers of 
theorists to decry fragmentation and, as an alternative, either to seek shelter in 
empirical research or to envision an all-encompassing synthesis or to cleave to 
a particular theoretical approach, a different reaction animates a fourth project. 
In this case, the presence of multiple approaches 1s welcomed as a potential op- 
portunity for fruitful theoretical dialogue. 

This program has affinities with the pluralist agenda articulated a genera- 
tion ago by theorists like Merton who held that “it is not so much the plurality 
of paradigms as the collective acceptance by practicing sociologists of a single 
paradigm proposed as a panacea that would constitute a deep crisis with ensu- 
ing stasis" (Merton 1975:29; cf Eisenstadt & Curelaru 1976, Rorty 1979). But 
current dialogic projects seek not only to emphasize the functions of theoreti- 
cal diversity but also to recognize and promote combinations and interactions 
of different theoretical perspectives—an objective that may be understood in 
different ways. 

In some contemporary projects, for example, theoretical dialogue is viewed 
as a means to other kinds of programmatic ends, including tool construction 
(Rule 1997), broad (or pa-tial) theoretical synthesis (R Collins 1994, Münch 
1994), and theoretical recanstruction (Calhoun 1995; see also Bauman 1992). 

Beyond such instrumental uses of discourse, however, dialogue has also 
been recently elaborated into a project in and of itself by Mouzelis (1995), 
Wiley (1990), and most fully, Levine (1986, 1995). Believing that communi- 
cation—calmly “grasping truly alternative points of view"—is “the healing re- 
sponse" to cries of fragmentation, Levine sets for theorists the task of cultivat- 
ing the “spirit of dialogue,” especially by “tak[ing] stock of our manifold 
forms of social knowledge" and thus building a “common vocabulary that so- 
cial scientists can use to engage in constructive conversations about their dif- 
ferences” while “opening [themselves] to a wider range of options” (1995:2, 
297, 305, 324, 329). This project underlies Levine's (1985, 1991) own efforts 
to examine the dialectic interplay between the ideas of Simmel and Parsons, 
and it can be seen as well in the work of scholars carrying forth the Marx- 
Weber dialogue (Antonio & Glassman 1985, Wiley 1987). 
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Project V: Enlargement/Reconstruction of Current 
Theoretical Approaches 


For many contemporary theorists the range of options before them is, or until re- 
cently has been, severely limited in some fundamental respect. Their project is to 
identify and fill one or more conceptual, methodological, social, moral, or politi- 
cal lacunae in the theoretical perspectives available to them and then to work 
out the reconstructive umplications of the resulting enlargement of sociologi- 
cal theory —implications that vary in scope and depth from theorist to theorist. 
Proposals for conceptual augmentation have been numerous and diverse, 
spanning the distance from B Turner's program to encompass human corpore- 
ality and overcome sociological theory's neglect of “the most obvious fact of 
human existence, namely that human beings have... bodies" (1984:41), on the 
one side, to steps by Alexander (19882), Archer (1988), Connor (1996), J Hall 
(1990b), Somers (1995), Swidler (1986), and Wuthnow (1987) to awaken con- 
temporary theory to the structural, symbolic, and institutional dimensions of 
culture and to recent work in anthropology and cultural studies, on the other 
side. Elsewhere, S Turner (1994) and Camic (1986, 1989) have urged renewed 
attention to habitual action— Turner to combat the concept of social practices, 
Camic to defend a sociology of action forms or practices. Others have called 
for the theoretical incorporation of revised conceptions of the self (Seidler 
1994, Wiley 1994, Wolfe 1991) and of emotion and the irrational (Campbell 
1996, Scheff 1997, Sica 1988). (See also Urry 1996 on the "singular absence" 
of the topic of time and space in sociological theory prior to the early 1980s.) 
The pioneering effort along these lines, published slightly earlier but widely 
discussed in the period under review, is Habermas’s concept of communicative 
action. Brought forth to accommodate circumstances where “the actions of the 
agents involved are coordinated not through ... calculations of success”—as 
assumed in theories that define action in instrumental, goal-directed, and related 
terms—-"but through acts of reaching understanding [viewed as] a cooperative 
process of interpretation,” communicative action provides Habermas with a 
“comprehensive counter-concept": a means of reconceptualizing rationality (as 
communicative rationality), the organization of the social world (as the relation- 
ship between the communicatively based “lifeworld” and instrumental “system” 
processes), and also the evolution and present condition of society (Habermas 
[1981]1984:101, 285-86, Joas 1991:99; see also Habermas [1981]1989, 
[1983]1990, [1985]1987, [1988]1992, [1992]1996, Calhoun 1992a, Honneth & 
Joas 1991a). More recently, Joas, who has criticized Habermas's formulations, 
has outlined an ambitious alternative to dominant action-theoretical approaches, 
one that seeks to focalize "the creativity of action"— the potentially creative 
dimension of all human activity (Honneth & Joas 1991b:4; Joas 1993, 1996). 
Faulting the tendency of modern theorists to squeeze social action into a di- 
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chotomy ofrational versus normative action, Joas draws from the early Ameri- 
can pragmatists to show that all action takes the form of an alternation between 
"unreflected habitual action" and the creation of "a new mode of acting, which 
can gradually take root and thus become itself an unreflected routine" 
(1996:129). He uses this reorientation as a foundation for reconstructing theo- 
ries of contemporary culture, collective action, and social change. 

Proposals to fill voids of a more methodological sort have also assumed 
various shapes. Often launched as critiques of “positivist” views of knowledge 
still felt to dominate the theory area, these programs—reviewed in detail else- 
where—have included efforts to predicate sociological theory on recent devel- 
opments in realist and hermeneutic philosophies of science (Outhwaite 1987, 
1996, Sayer 1992, Shapiro & Sica 1984) as well as on some of the diverse epis- 
temological lessons of critical theory, feminist theory, poststructuralism, post- 
modernism, and the sociology of science (Agger 1991, DeVault 1996, Fuller 
1988, J Hall 19902). More prosaically, work ın the theory area, as it was passed 
on by the previous generation of theorists, has been charged with a neglect of 
history, with applying "universal theoretical terms [to] all aspects of social 
life, regardless of times and places" (Skocpol 1984:2-3). In the last two dec- 
ades, this critique has inspired numerous theoretical contributions sensitive to 
the specifics of time and location (for review, see Mandalios 1996), among 
them Mann's (1986, 1993) theory of social power and Sewell's (1992, 1996) 
efforts to reformulate Giddens's concept of social structure and to make 
"events" a viable "theoretical category" (see also Abbott 1995). 

Perhaps the most distinctive current focus, however, has been on an omission 
brought to light by various contemporary social movements. This is the “reality of 
differences" (Lemert 1995): what Calhoun defines as the “basic and urgent proj- 
ect [of] developing ways to take seriously such fundamental categorical differ- 
ences" as gender, race, ethnicity, sexuality, language, class, region, and nation 
(1995:xix-xxii). Among these differences, it is the neglect of gender that has 
received the widest treatment, as feminist scholars have attacked sociological 
theory for its restrictive assumptions about which aspects of social reality merit 
analysis, for its marginalization of feminist contributions, for its masculinist reli- 
ance on "logical dichotomies" and tendency to suppress the agency of subjects 
and "re-attribut[e it] to social phenomena" (Sprague 1997: 93; Smith 1989:49; 
see also Alway 1995, Lengermann & Niebrugge-Brantley 1990, Stacey & 
Thorne 1985, 1996, Wallace 1989). On such grounds, feminist theorists have 
proposed varying degrees of theoretical reconstruction: in some cases, the inclu- 
sion of gender into established theoretical approaches (Chafetz 1988, Walby 
1988); in others, radical alterations of the aims and procedures of all existing 
approaches (e.g. Haraway 1985, Smith 1990, 1993; for review, see Chafetz 
1997, Clough 1994, Lovell 1996). Related lines of argument have emerged else- 
where as well: in the demand for theories of oppression, struggle, motherhood, 
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and community constructed from the neglected and “unique standpoint [of] 
Black womanhood, unavailable to other groups" (P Collins 1990: 33); in the cri- 
tique of sociological theory for omitting the concerns and contributions of peoples 
of color 1n both the Western and post-colonial worlds (Connell 1997, S Hall 
1996, Gilroy 1993); and in the claim that theory has also avoided questions of 
sexual identity, exhibiting both disinterest in gays, lesbians, bisexuals, and 
transsexuals and an unwillingness to rethink its basic terms from the perspec- 
tive of sexual difference (Butler 1993, Clough 1994, Seidman 1996). 

Amid these developments, sociological theory has also been widely in- 
dicted for its exclusion of moral issues, for having strayed from its origins to be- 
come a “detached enterprise of abstract problems [that lack] immediate implica- 
tions for everyday schemes of moral and political action and belief" (Wardell & 
S Turner 1986:11; see also Levine 1995, B Turner 1989, 19962). An interest in 
ending these exclusions forms a leitmotiv in many of the works already re- 
viewed in this section, some of them explicitly advocating a renewal of “critical 
theory’—in Habermas’s case from the standpoint of communicative rational- 
ity, in Calhoun’s from the perspective of difference. But heightened concern 
with theory as a moral and political enterprise is evident in numerous other 
sources, including Selznick’s (1992) sweeping program to build a “liberal 
communitarianism” on the foundations of pragmatist social theory (cf Bellah 
et al 1985, Etzioni 1996, Horowitz 1993, Wallerstein 1997, Wolfe 1989). 


Project VI: Engagement with Past Theoretical Ideas 


Proponents of the preceding projects differ markedly in their attitude toward a 
sixth line of theoretical activity. This is the analysis of various theoretically 
significant ideas found in the writings of social thinkers from the past—a pro- 
gram that brings together theorists working to advance some of these other 
contemporary projects and scholars who are relatively aloof from such efforts. 

The range of work falling within this category is vast. Recent years have 
witnessed an outpouring of research on and reinterpretation of all the mayor 
classical theorists, Comte, Spencer, Marx, Durkheim, Weber, Simmel, and 
Mead (for a review, see the chapters in Camic 1997a), along with efforts to revi- 
talize and critically extend their perspectives, as seen, for example, in various 
lines of “neo-Weberian” scholarship (R Collins 1986b, Eisenstadt 1987, 1992, 
1996, Kalberg 1994, Lehman & Roth 1993, Scaff 1989, Schluchter 1989, Sica 
1988, Whimster & Lash 1987). Increased interest has also been shown in the 
ideas of such latter-day figures as Parsons, Mannheim, Merton, and Garfinkel 
(Camic 1991, 1992, Crothers 1987, Heritage 1984, Hilbert 1992, Kettler et al 
1984, Sztompka 1986) and in the work of earlier women thinkers (McDonald 
1994), intellectuals of African descent (Lemert 1993, Seidman 1994a), and 
other “neglected theorists" (Sociological Theory 1994—1995), Norbert Elias 
perhaps chief among them (Fletcher 1997, Mennell 1989). 
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These studies cannot be separately reviewed here, but generally they locate 
themselves as projects integral to the theory area (rather than as contributions 
mainly to the history of sociology) on different grounds. These run the gamut 
from the utilitarian claim that works from the past contain concepts and gener- 
alizations “worth salvaging and using" in order to address contemporary re- 
search questions (R Collins 1986b:3) to the *noninstrumental" idea that “con- 
versing with great minds," via “writings of unsurpassed scholarly texture,” is a 
vital experience, “humbling and invigorating” (Poggi 1996:40—42; cf Alexan- 
der 1989). Between the extremes, scholars have also recently urged the careful 
application of hermeneutic methods to past theoretical works; instead of “can- 
nibalizing classical statements in search of testable fragments," such methods 
strive for “a contextual or holistic understanding of [each] theorist's project" 
(Sica 1988:138). In this way, they expand the range of theoretical alternatives 
beyond the bounds of present possibilities and reveal the contingent historical 
processes by which some theoretical approaches were established at the ex- 
pense of others (Camic 1997b; cf Calhoun 1995, S Turner 1996a). 


Project VII: Diagnosis of Contemporary Social Conditions 


Not past theoretical ideas but present social realities form the main focus of a 
seventh contemporary project. This sets for theory the task of providing for the 
late twentieth century what the classical theorists furnished for their era but 
now demands overhaul: an analysis of the nature and dynamics of modernity, 
of the social forces shaping human life in the current period. 

This project is sometimes pursued in close conjunction with others we have 
identified. On the basis of his concept of communicative action, for example, 
Habermas ([1981]1989) examines the modern world (descriptively and nor- 
matively) in terms of the growing penetration, or “colonization,” of the com- 
municative processes of the “lifeworld,” found in areas such as family and 
public discussion, by the instrumental processes of economic, political, and le- 
gal “systems.” In turn, Luhmann draws on systems theory to argue that 
whereas “in modern society, functional systems...have the possibility of self- 
steering,” there are inherent limits on the extent to which the political system 
can effectively regulate other social spheres (1997:45). In Luhmann’s view, 
the contemporary world is unique in the intensity of its calls for such regula- 
tion; analysis of the limits of societal steering thus forms part of “the self-de- 
scription of modern society" (1997:54). For his part Giddens mobilizes struc- 
turation theory to characterize the present age of “late modernity” as the “in- 
creasing interconnection" between two developments: the reflexive organiza- 
tion of "self-identity," as individuals confront distinctive forms of risk amid a 
"transformation of intimacy" and relationships of trust; and “globalization” in 
its economic, political, military, and cultural dimensions (Giddens 1991:1—6, 
see also 1987, 1994a,b, 1996). 
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This emphasis on globalization is not unique to Giddens. Albrow's com- 
ment that contemporary sociologists “are falling over themselves in their haste 
to re-thematize old topics in the light of globalizing processes” (1993: 732) 
perhaps reflects the particular perspective of British sociology (see, e.g., 
Featherstone 1990, Robertson 1992). But Albrow's (1997) own recent work 
on globalization, Sklair's (1995) attempt to transcend state-centered views of 
the global system with a theory of "transnational practices," and Meyer et al's 
(1997) analysis of the diffusion of the nation-state model in the context of 
“world society" are lines of thinking that all evidence growing concern with 
the dynamics and consequences of new patterns of interconnection among the 
world's societies, institutions, and peoples. 

Efforts to theorize contemporary conditions have taken several other direc- 
tions as well. For Touraine, “social life is constructed through struggles and 
negotiations around the implementation of...cultural orientations"— struggles 
that, “m today's post-industrial society...center upon... the production and 
mass distribution of representations, data and languages" and thus turn the 
analytical spotlight to contemporary social movements (1995:358, 1988; see 
also Eder 1993, Offe 1985). For Beck, the modern world has witnessed a “ren- 
aissance of political subjectivity” due to the emergence of a “risk society” be- 
set with environmental hazards; here “socially recognized risks...contain a pe- 
culiar political explosive" that ignites activity outside conventional politics 
(1992:24, 1994:18). In other work, theorists have looked to the public sphere 
(Calhoun 19922), identity politics (Calhoun 1994), the collapse of the Com- 
munist bloc (Offe 1997), the apparent waning of the ideology of liberal re- 
formism (Wallerstein 1995), doubts about the “intelligibility” and “shapeabil- 
ity" of the social world (P Wagner 1994:176), citizenship debates (B Turner 
1993, Laclau 1990), and the nature of contemporary capitalism (Lash & Urry 
1987) and culture (Bauman 1992; see also the new afterword in Bell [1976] 
1996)—all to advance the agenda of coming to intellectual terms with the dis- 
tinctive characteristics of the current historical epoch. 


Project VIII: Dissolution of Sociological Theory 


Ás theorists have reckoned with the features of the present age, a very different 
project—an anti-project—has emerged in proposals to bring the enterprise of 
sociological theory to an end. Accepting in various degrees the post-modernist 
assault on the activity of social-scientific "theory" and on the reality of the “so- 
cial,” these proposals generally call for the multidisciplinary development of 
critical, non-social-scientistic narratives. 

The challenge to "theory" has been raised by Lemert, who employs post- 
modernist epistemological critiques to argue that whatever social theory to- 
day is or is not, it has few choices beyond accepting, getting around, or resist- 
ing the stance of radical and continuing doubt about the final vocabulary avail- 
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able for use in speaking about the social world" (1992:20). Given this, Lemert 
(1992, 1993, 1994a,b, 1995) holds that sociological theorists must relinquish 
their aspirations to be purveyors of abstract scientific truths, instead becoming 
what Rorty (1989) calls “ironists”——intellectuals who recognize that there ex- 
ists no theoretical metalanguage for objectively arbitrating between competing 
truth claims. Drawing on thinkers like Foucault (1972) and Lyotard (1984), 
Seidman launches a similar attack on sociological theory for its generalizing, 
universalist ambitions, observing that “once the veil of epistemic privilege is 
torn away by post-modernists, [social-scientific theory] appears as a social 
force enmeshed in particular cultural and power struggles" (1994c:124). This 
kind of sociological theory should therefore be replaced by a diversity of "so- 
cial theories" in the form of "broad social narratives [that] relate stories of ori- 
gin and development, tales of crisis, decline, or progress," but that do not pre- 
tend "to discover the one true vocabulary that mirrors the social universe" 
(1994c:120). Moreover, because sociologists hold no monopoly over such nar- 
ratives, theorists need to “move sociology away from its historic role as a disci- 
pline" and open themselves up to insights about the social from feminist the- 
ory, post-colonial studies, queer theory, and so on (Lemert 1994a:268; see also 
Seidman 1994c, B Turner 1990). 

For other contemporary scholars, however, even the "social" itself is deeply 
problematic. This is so, in Game's (1991) view, because the writings of Bar- 
thes, Derrida, and Irigaray have shown that "the social 1s written, that there is 
no extra-discursive real outside cultural systems" (1991:4). For this reason, all 
forms of sociological theory that assume that theoretical terms are ideational 
representations of reality must give way to a "deconstructive sociology" that 
imagines and analyzes the social only as text, alert to the power interests that 
lie behind representationalist thinking (see also Brown 1990, Clough 1993). 
For Baudrillard (1983), in contrast, it 1s the present historical period itself that 
marks the "end of the perspective space of the social." As a result of the con- 
temporary diffusion of various technologies of simulation, the activity of rep- 
resentation has itself become increasingly impossible, and the social, as a rep- 
resentation of the real, collapses: "[T]he social only exists in a perspective 
space, 1t dies in the space of simulation..." (1983:83). Under these conditions, 
sociological theory as a discourse on the social necessarily loses all viability 
(see Kellner 1994, Bogard 1990, Denzin 1986, Gane 1991; for critical discus- 
sion, see Bauman 1992, Seidman & D Wagner 1992, Seidman 1994b). 


DISCUSSION AND CONCLUSION: CONDITIONS OF 
POSSIBILITY 


There are areas in sociology where the presence of this number of active and 
diverse projects would generally be seen as evidence of intellectual vitality. 
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Among the majority in the theory area, however, this is not the prevailing sen- 
timent. Pessimists, disillusioned over what they take to be the direction of the 
area, here greatly outnumber those who are now sanguine about the field or 
some particular contemporary project. 

Expressions of this attitude are widespread, though efforts to get to its roots 
have been meager. Proponents of the theoretical projects described above of- 
ten portray themselves in dismay, facing a hard battle on behalf of their cause. 
Rarely, however, do theorists provide a serious and constructive assessment of 
the forces arrayed against their particular agenda. To the contrary, most con- 
temporary theory programs put themselves forth on exclusively intellectual 
grounds, giving minimal sociological attention to the organizational condi- 
tions that confront them—conditions these programs must reckon with realis- 
tically if they are to advance their aims. Four important conditions of possibil- 
ity are discussed below. 


OTHER PROJECTS With isolated exceptions, few contemporary projects 
evince awareness of the range of other projects with which they share the the- 
ory field. This situation precludes systematic consideration in the field of pos- 
sible relationships among projects: of the different intellectual questions to 
which the various projects may be differentially applicable; of the points at 
which different programs might operate in collaborative, complementary, in- 
tegrable, or cross-cutting modes, in contrast to points at which they are com- 
petitive or mutually irrelevant (on these options, see Levine 1986). Added to 
this problem is the tendency among several projects simply to lump the major- 
ity of alternative projects together in an undifferentiated mass. It 1s thus that 
advocates of tool-construction (Project I) are prone to regard all other theoreti- 
cal efforts as little more than the exegesis of past ideas (Project VI?), overlook- 
ing in the process even a number of differently formulated variants of their 
own program—an oversight exhibited in other projects as well. By this kind of 
lumping, various projects misassess their relative position in the theory area: 
tool-constructors, for example, regularly, but wrongly, presenting their agenda 
as a beleagured minority stance; synthesizers (Project IT) erring in the opposite 
direction by absorbing all other lines of theoretical work into their own pro- 
gram, thereby fostering the widely accepted but inaccurate view that this one 
project is contemporary sociological theory as such. 


SOCIOLOGY AT LARGE Quantitative data on the organization of sociology 
document that, during the very period when these contemporary theory proj- 
ects have been under way, not only did "new theoretical approaches [fail to] 
provide a unified perspective for the discipline as a whole," but contributions 
from the theory area ceased “to be closely connected to the sociology litera- 
ture" at large (Crane & Small 1992:229—30). Historical evidence suggests that 
this is no recent development: that the impact of sociological theory on socio- 
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logical research has long been tenuous, with theory an unwanted presence in 
many of the ever-changing speciality areas of the discipline (Sica 1989a,b). 
Whth occasional exceptions, however, theorists have been slow to think 
through the implications of this and to fashion suitable roles for themselves un- 
der these conditions. To the contrary, current theoretical projects often read 
like efforts to wish the conditions away, to deny that the strong "internal con- 
nectivity" between research areas that gives centrality to "theory" in a disci- 
pline like economics is simply not an institutional feature of sociology (Crane 
& Small 1992:208, 231). 


EXTERNALFORCES  Asaresult of its own "fragile professional standing," the 
entire discipline of sociology has been buffeted during this same recent period 
by major changes in its social and institutional links to other academic disci- 
plines, in its relations to its different patrons and publics, and in the size and 
composition of its student clienteles (Halliday 1992). Save for scattered com- 
mentary on the first of these points, however, theorists have rarely considered 
the bearing of these developments on the viability of their own projects. And 
even regarding the first point, their opinions are divided, with some theorists 
anticipating only minor adjustments in the disciplinary status quo and others 
expecting and calling for closer ties between sociological theory and every- 
thing from biology to all the social sciences as well as history and cultural stud- 
ies. For those urging these ties, the common hope is that they will produce a 
“flourishing” back-and-forward exchange between these other fields and work 
in sociological theory (Calhoun 1992b); but theorists need also to recognize 
that reciprocated exchanges have been the exception—that theoretical projects 
inside sociology have drawn from without far more often than they have been 
elsewhere drawn upon (Crane & Small 1992:231—32). 


HISTORICAL LEGACIES Almost invariably, contemporary theoretical projects 
rest on historical narratives, accounts of the theory field (and sometimes of 
sociology and forces external to it) at previous times—the 1940s and 1950s, the 
1950s and 1960s, the 1960s and 1970s—different theorists choose different 
slices. But their historical narratives are ordinarily more than window dressing: for 
each narrative is the baseline for the particular project that the theorist seeks to put 
on the agenda for the future, an analysis of what the theory field thus far offers 
and does not offer, an inventory of its resources and lacunae, of its problems 
and possibilities. Take the historical narrative away and one removes much of 
the rationale for the project it accompanies. Yet, for all this, few theorists ex- 
hibit much historical care in constructing their narratives, in defining the intel- 
lectual and institutional legacies that the contemporary theory field actually 
confronts. Indeed, so lax are theorists in this respect that their historical claims 
tend not only to cancel one another out, but also to meet broad disconfirmation 
when subjected to direct historical investigation (see Platt 1996). 
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In the course of advancing their different projects, a few contemporary 
theorists have given serious attention to some of these organizational factors 
(Bourdieu & Wacquant 1992, Calhoun 1992b, R Collins 1986a, 1989, Levine 
1995, Sica 1989b, S Turner & J Turner 1990). But systematic examination of 
the intellectual and institutional relations among the diverse projects that now 
constitute the theory field, of the field’s complex historical legacies, and of its 
current position within sociology and in regard to various external forces has 
yet to crystallize into a program in its own right, a project that would problema- 
tize the form or forms appropriate to sociological theory under present condi- 
tions of possibility. If at least some current projects are to succeed, however, 
contemporary developments would seem to warrant also including this project 


square on the theorist's agenda. 
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ABSTRACT 


Computer-assisted data analysis 1s usually associated with the analysis of ag- 
gregate data according to the tenets of logical positivism. But there are more 
than twenty computer programs designed to assist researchers analyzing eth- 
nographic data, and these programs may be used by researchers with a vari- 
ety of epistemological orientations. Some computer-assisted qualitative data 
analysis (CAQDA) programs automate analysis procedures that have been 
used by generations of ethnographers. Others open up new directions 
through the use of linked coding schemes, bypertext, and case-based hy- 
pothesis testing. Ethnographers 1nterested m computer assistance must ac- 
quaint themselves with the variety of capabilities and programs available be- 
cause no one program dominates the CAQDA field. In this article, we pro- 
vide an overview of the theoretical and practical considerations bearing on 
the choice of CAQDA software. 
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INTRODUCTION 


Computer-assisted analysis in sociology is currently associated with the pro- 
fession's constructed category of "quantitative research" rather than its con- 
structed category of “qualitative research."! Statistical procedures available in 
mainstream software packages such as SAS and SPSS facilitate the analysis of 
aggregate data, and most sociologists using these data have adopted an orienta- 
tion of logical positivism. Thus, computer-assisted analysis carries connota- 
tions of hard data, computation, and objectivity. On the other hand, sociolo- 
gists associated with qualitative research have generally held that aggregate 
data analysis using statistical procedures either misses important sociological 
causes of social action or emphasizes explanation (the hallmark of logical 
positivism) at the expense of understanding. The general overstatement of the 
differences between quantitative and qualitative research has meant that quali- 
tative researchers have shown a slowness, if not reluctance, to use computer 
assistance in data analysis. 

Just as it can with aggregate data, computer assistance can facilitate system- 
atic computational research with qualitative data. In addition, CAQDA (com- 
puter-assisted qualitative data analysis) technologies can be useful for re- 
searchers who place themselves outside the positivistic research tradition. For 
example, within participant-observation research, there are three epistemo- 
logical traditions: positivism, symbolic-interactionism, and ethnomethodology. 
While symbolic-interactionism and ethnomethodology appear antithetical to the 
use of computer-assisted analysis, a close look at the capabilities of current 
CAQDA software suggests these packages could be useful for research in 
these traditions and could become even more useful in the near future. 

Software for the analysis of qualitative data has appeared relatively re- 
cently, and although qualitative sociologists have been slow to adopt this 
software, at present there are more than twenty packages available. No one 
package dominates, however, so the qualitative analyst interested in computer 
assistance must decide which package to use. The amount and kind of ethno- 
graphic data on hand or to be collected, the purpose of the research, the epis- 
temological framework of the researcher, and the goodness of fit between the 
research project and the capabilities of available software all figure into this 
decision. In the end, the analyst may decide that no CAQDA software is called 
for. A review of ethnographies in journals and books published in the last five 
years suggests that many ethnographers either do not use any CAQDA soft- 


l The constructed categories of “quantitative” and “qualitative” research have led sociologists 
to misunderstand the fact that real differences in research method are due to adherence to different 
epistemologies and not to the use of quantitative or qualitative data. For ease of reading, we drop the 
use of quotation marks, but the constructed nature of the categories quantitative research and 
qualitative research should be borne in mind 
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ware or consider their use of it so unobtrusive that they fail to mention it at all 
in their ethnographic reports? 

In this article, we discuss the capabilities and limitations of CAQDA in gen- 
eral, the factors that distinguish one CAQDA software package from another, 
and specific qualities of a number of software packages. We intend this discus- 
sion to serve as a theoretical and practical introduction to CAQDA use. In the 
article's conclusion, we return to the methodological issues broached above to 
consider if the gap between quantitative and qualitative research might be 
bridged by current and future CAQDA packages. We do not endorse any par- 
ticular analytical strategy nor do we recommend or review any particular soft- 
ware package in this article? 


QUALITATIVE DATA ANALYSIS AND THE 
PERSONAL COMPUTER 


From QDA to CAQDA 


Ethnographers have been using computers for decades. Interviews and field- 
notes have been transcribed into word processors (Kirk 1981), and many eth- 
nographers now carry portable lap-top computers into the field. The use of 
computers for the analysis (rather than the gathering) of ethnographic data is a 
more recent development. 

Many CAQDA software packages facilitate data analysis from the 
grounded-theory perspective codified by Glaser & Strauss (Lonkila 1995). 
Grounded theorists advocate close contact with raw data, the emergence of 
analytical categories from the data through memo writing, and comparison as 
the primary analytical tool (Glaser & Strauss 1967). Elements of grounded the- 
ory are common in CAQDA in part because Glaser & Strauss are explicit about 
the principles and procedures involved in this kind of analysis (see especially 
Strauss 1987). 

Several general approaches to qualitative data analysis (QDA) incorporate 
elements of grounded theory or are consistent with that perspective. For exam- 
ple, while Pfaffenberger does not explicitly embrace the grounded-theory 
method of analysis, his three fundamentals for the analysis of qualitative data 


2With few exceptions, ethnographic works published in Qualitative Sociology, Journal of 
Contemporary Ethnography, and Symbolic Interaction as well as in the American Sociological 
Review, the American Journal of Sociology, Social Forces, and Social Problems make no mention 
of the use of CAQDA Book-length ethnographies in our areas of expertise (gangs, poverty, urban 
and community studies) reviewed in the journals listed above (as well as in Contemporary 
Sociology), and those discussed 1n three recent overviews of qualitative research (Charmaz & 
Olesen 1997, Horowitz 1997, Morrill & Fine 1997) also rarely mention CAQDA 

3By way of full disclosure, Sánchez-Jankowsk1 1s now using the askSam package, and Dohan 18 
using the Folio Views package 
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(rewriting, coding, and comparison) are consistent with grounded-theory 
methods (Pfaffenberger 1988:26—30). Huberman & Miles propose a three-part 
conceptualization of the analysis process: data reduction, data display, and 
conclusion drawing/verification. As in grounded-theory approaches, data re- 
duction, display, and conclusion drawing are causally and temporally inter- 
twined (Huberman & Miles 1994:429). Huberman & Miles expand these ab- 
stract procedures into a concrete set of thirteen "tactics" for undertaking quali- 
tative analysis (Huberman & Miles 1994:432). These range from noting pat- 
terns and themes, clustering, and counting to making contrasts and compari- 
sons, shuttling between data and categories, building a logical chain of evi- 
dence, and making conceptual/theoretical coherence. Tesch distills ten "prin- 
ciples and practices" in the analysis of many types of qualitative data. While 
the principles (data analysis is concurrent with collection, analysis is not rigid, 
and the result of analysis is a higher-level synthesis) outnumber the practices 
(data are segmented, data are categorized, the main intellectual tool is com- 
parison), Tesch's approach follows the general thrust of grounded-theory 
analysis (Tesch 1990:95—97). 

QDA that 1s consistent with grounded theory uses a sequential style of 
analysis that is highly data-intensive. Advocates of these methods urge the 
analyst to begin data analysis while collection is under way, to reduce the data 
using codes or categories, to shuttle between data and codes, and to compare 
coded and raw data to make tentative and ultimate conclusions. This analytical 
strategy returns the analyst to the database over and over again, and each step 
of analysis is readily translated into computer modules and procedures. Be- 
cause grounded-theory and similar analytical strategies are consistent with 
logical positivism, they present practical challenges to computer programs but 
few epistemological challenges. 

Researchers relying on context-dependent methods of analysis such as the 
extended-case method (Burawoy 1998, Burawoy et al 1991), symbolic-inter- 
actionism (Blumer 1969:Ch. 1), and ethnomethodology (Garfinkel 1967) may 
also find software designed around the grounded-theory principles helpful. 
But they are less likely to be able to take advantage of all the built-in features of 
these packages. The analytical principles of these context-dependent methods 
are more difficult to codify than those of grounded theory. So, while grounded 
theorists may find themselves able to take advantage of a wide variety of com- 
puter resources as they move from QDA to CAQDA, ethnographers working 
in other traditions may find that coroputer assistance limits their analyses un- 
less they limit the extent to which they make use of computers. 


First Steps and Basic Capabilities of CAQDA 


Fortunately for ethnographers working outside of the grounded-theory tradi- 
tion, computer assistance is not an all-or-nothing affair. Some features of con- 
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temporary CAQDA may be used and others may be ignored. We organize our 
discussion of the practicalities of CAQDA analysis around seven tasks per- 
formed by the user of CAQDA software (Weitzman & Miles 1995:Ch. 3). 
Analysts, whether positivists or not, may find some of the tasks required by 
CAQDA to be theoretically or practically onerous. But different software 
packages require different tasks, so analysts can pick and choose software that 
facilitates the tasks they are interested in without requiring those they find ob- 
jectionable. 


ENTERING DATA Seemingly mundane decisions made early in the ethno- 
graphic project may have significant practical, methodological, and theoretical 
consequences. How to enter data into the computer is a seemingly mundane 
decision with enormous consequences. Which data are entered into the com- 
puter, how they are entered, and which remain outside the computer shape all 
further analyses of the data. Fischer notes that data can enter a computer in a 
myriad of forms, from the “beginning” methods of text processing on a word 
processor to “advanced” methods of digital signal processing of videotape 
(Fischer 1994:15—21). For the moment, we confine ourselves to issues related 
to the entry of text; we address audio, visual, and graphic forms of data in later 
sections of this article. A primary consideration for researchers entering text 
data into the personal computer is the size of the textual unit of analysis. Notes 
entered into a dedicated CAQDA package are divided into analysis 
"chunks"—which can be single words, lines of text, paragraphs, hypertext 
note cards, or larger files. Especially important is the size of chunks—the indi- 
visible units that are de-contextualized and re-contextualized during the analy- 
sis process (Tesch 1991b). For example, larger chunks of text are more likely 
to contain data falling into several analytical categories, and this may compli- 
cate positive correlational analysis. But for analysts interested in context- 
dependence, smaller chunks may prove worthless unless the CAQDA software 
contains elaborate coding or linking procedures. 

Practical issues also arise at the data-entry stage. Should the qualitative da- 
tabase include expanded and annotated fieldnotes, interview transcripts, and 
memos? Memos can be electronically linked to existing data via hypertext 
connections or in situ “pop-up” notes. What are the advantages of integrating 
memos into the qualitative database? Are the advantages of an all-inclusive da- 
tabase worth the costs of greater storage overhead and slower processing 
times? Removing memos from the CAQDA software environment may ham- 
per the goal of comprehensive analyses of ethnographic data, but it may bolster 
a sociological imagination that extends beyond the parameters of a particular 
software package. 


ORGANIZING DATA Cases and variables organize quantitative datasets. The 
organization of ethnographic data varies depending on the research project at 
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hand. The number of ethnographers involved in the project, the number of 
field sites, the variety of data types, and the theoretical orientation of the re- 
searchers all influence how the ethnographic dataset is organized. Researchers 
should at least familiarize themselves with basic database-management princi- 
ples (Tesch 1990:199—210; Winer & Carriére 1991) to be sure that early deci- 
sions about the structure of the qualitative database do not create insurmount- 
able data- management problems later in the project. 


SEARCHING FOR AND RETRIEVING DATA Computers increase the ethno- 
grapher's ability to search for and retrieve text. For some ethnographers, search 
and retrieval represents the end of the computer's usefulness as a qualitative 
data analyst assistant, and several CAQDA packages are designed for this kind 
of analysis. At the least, searching for and retrieving data involves the ability to 
find and display a string of text characters that has been entered into the database 
(Tesch 1990:181—94). CAQDA software usually allows ethnographers to search 
for root forms of words or synonyms, to use wildcard characters, and to mount 
combination searches such as those based on word proximity or word order. 
Boolean-defined searches for multiple items round out the menu for searches. 
Retrieval of searched-for items is governed, again, by the size of text chunks and 
the flexibility of the package in retrieving consecutive or proximate chunks. 


CODING DATA Coding, also referred to as indexing (Richards & Richards 
1991a, 1994:457) or content analysis (Berg 1995:Ch. 9), is a central feature of 
much CAQDA. The use of the computer need not affect the fundamentals of 
data coding. Weaver & Atkinson coded their illustrative fieldnote material 
manually before entering the data into their CAQDA package (1994:52—53). 
Most discussions of computer-assisted coding reinforce what Weaver & At- 
kinson learned through practical experience: The hard work in coding data is 
intellectual, not mechanical. Computer assistance does not relieve the eth- 
nographer of the need to spend many hours devising, revising, and applying an 
indexing system that is reliable and valid (a general approach is Werner & 
Schoepfle 1987). Moreover, computer assistance can impose limitations or re- 
strictions on the coding process that can create problems for ethnographers 
(Weaver & Atkinson 1994:38—42). Coding should be driven by the theoretical 
orientations that inspired the original research. Analysts must be confident that 
using the computer facilitates their work. They must remain alert to the possi- 
bility that coding data with a well-designed computer program can become an 
end unto itself; highlighting sections of text with combinations of on-screen 
colors or sorting and re-sorting half-coded notes can easily create the comfort- 
ing appearance of progress. 


ANALYZING CODES Analysis of codes begins as soon as the first data are 
coded. Codes are defined ın relationship to each other, so their application to a 
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set of data implies theory. CAQDA software can make this implicit theory ex- 
plicit by generating a list or map of codes and their relationships. Some pack- 
ages constrain the development of a coding scheme to encourage the analyst to 
make positive connections between codes, such as hierarchical connections 
between more and less inclusive ones (Richards & Richards 1995) or sequen- 
tial connections between coded events (Carsaro & Heise 1990). Analyzing 
codes is thus simultaneous with the coding process. 

Once sufficient data have been coded, other analytical possibilities de- 
velop. In most CAQDA packages, analysts search for codes as easily as they 
explore raw data. Boolean capabilities are useful here, particularly for analysts 
interested in computation, because they allow the ethnographer to count in- 
stances of codes or conjunctions of codes. Alternatively, packages that retrieve 
text associated with particular codes or conjunctions of code may be useful for 
analysts interested 1n interpretational analysis. 

Aside from data entry, the analysis of codes is the area of the computer's 
greatest influence on theory and methods. Software design may force the ana- 
lyst to consider the previously unexamined relationship between concepts in 
the research project. The flip side ofthe coin is that software may limit the abil- 
ity of the analyst to develop theory 1n desired directions. The ability to mount 
comprehensive searches for codes and sets of codes means that ethnographic 
analysis may benefit from less bias. But large-scale searches can also bury the 
analyst in chaotic results. In short, the computer-assisted analysis of codes has 
theoretical and methodological implications surpassed only by those taken 
during the first steps of data entry. 


LINKINGDATA The most recent development in the analysis of qualitative data 
requires computer assistance (Coffey & Atkinson 1996:181—87). Software 
available in the last decade allows analysts to create hypertext links between 
combinations of data, codes, memos, and research reports. Graphics, sound, 
and video may also be incorporated 1nto "hyperspace" databases; (Weaver & 
Atkinson 1994:Ch. 5). Analysis based on data linking may prove a boon for 
ethnographers who collect non-textual data, especially if hypertext moves out 
of the researcher's office and becomes a medium for the distribution of re- 
search reports. Even for ethnographers who rely exclusively on text, the meta- 
phor and activity of creating links in the ethnographic database have potential 
for generating innovative results. For researchers working outside of the posi- 
tivistic tradition, linking data may be particularly valuable. Hyperlinks concre- 
tize nonlinear data-analysis techniques and free the researcher from reliance 
on computation. Reports that incorporate graphics, sound, and video can more 
readily make the case for the significance of context. 

But hypertext technology also imposes special limitations on analysts. At 
present, the incorporation of text into hypertext "spaces" is inevitably fraught 
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with more burdensome formatting limitations than those imposed by tradi- 
tional text databases. Integrating sound or video into an ethnographic database 
involves technological expertise beyond the use of the word processor. In ad- 
dition, the publication of materials using sound or video technology may intro- 
duce new ethical considerations, such as the protection of research subjects' 
confidentiality. 


ANALYZING LINKS Analyzing links within the database is a more general 
form of analyzing codes. As in the analysis of codes, links may be analyzed 
only after a certain number have been established in the data. Once estab- 
lished, the links may be abstracted from the original data and analyzed as a sys- 
tem or network of their own. Compared to the analysis of codes, the analysis of 
links is more flexible and general. Greater complexity is possible in hypertext 
links than in coding schemes, so the representation of linked data may conse- 
quently be more complicated. At the same time, the ability to grasp at a glance 
a properly abstracted set of links allows analysts to bring "right brain" analysis 
to ethnographic analysis even when coding and linking have produced a com- 
plex data structure (Agar 1991). Similar to linking technologies, the com- 
puter's ability to analyze links may be especially appreciated by those working 
in symbolic-interactionist, ethnomethodological, and other nonpositivist tradi- 
tions. The challenges and drawbacks of linking data, codes, and memos apply 
equally to their analysis. 


Summary 


Computer-assisted qualitative data analysis does not differ fundamentally, for 
the most part, from the nonmechanical qualitative analysis traditions from 
which it has developed. Most computers ease the labor burden and broaden the 
Scope of common analysis tasks such as typing up field data and memos, 
searching for text, coding data, and sorting and comparing codes. Hypermedia 
is an unique contribution of computer technology to the analysis of qualitative 
data. Linking text, analysis, and non-text materials (graphics, sound, and 
video) in a single analytical space outside of the mind's eye is not possible 
manually. 

Computer assistance is not free—theoretically or methodologically. The 
design of most CAQDA software after the metaphors and practices of 
grounded-theory analysis means that ethnographers who are working outside 
of that tradition may have to coax a recalcitrant software package into aiding 
their preferred style of analysis. Naturally, the less assistance the ethnographer 
requires from the computer, the less intrusive the grounded-theory perspective 
is likely to be. The computer also makes demands on the form of ethnographic 
data collected. At present, the computer still favors word-processed text over 
other forms of data such as sketches, maps, photographs, video images, or re- 
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corded sound. But as computers increase in power, analysts can look forward 
to gaining greater digital control over non-text data. The experience of flipping 
through pages of fieldnotes—sketches, diagrams and coffee stains—will 
never be replicated on the computer monitor. But if knotty problems such as 
protecting the identity of research subjects can be overcome, computers may 
soon provide a compelling auditory and visual alternative to this tactile experi- 
ence. 


CAQDA SOFTWARE 


Computers can be programmed to accomplish four different kinds of analysis: 
numerical/arithmetic analysis, writing and document processing, data organi- 
zation, and symbolic manipulation (Fielding & Lee 1991:2—3). Ethnographers 
use computers for all these kinds of analysis. Our overview of contemporary 
CAQDA software is organized around major distinctions in how data are or- 
ganized and how symbols are manipulated by different packages. This over- 
view is not meant as a thorough guide for the prospective purchaser of 
CAQDA software. That reader should read reviews of programs (Prein et al 
1995; Tesch 1990; Tesch 1991a, especially volume 2; Weitzman & Miles 
1995), consult published discussions of researchers' experiences with 
CAQDA software (cited passim below), and try out different software with his 
or her own data. In preparing this overview, we have drawn especially on Com- 
puter Programs for Qualitative Data Analysis (Weitzman & Miles 1995), 
which contained the most thorough and up-to-date reviews available at press 
time. 


Document Processing: Searching and Retrieving 


Word processing is the bread and butter of computer assistance for the eth- 
nographer. The only computer assistance many ethnographers require is 
searching with a word processor. Basic searches retrieve a text string from a 
single computer file. More advanced searches count the occurrences of a 
string, and stand-alone search engines can search multiple files and produce 
extracts of search "hits" 1n context. The General Inquirer, the first CAQDA 
package, produced lists of word counts from a selected file as a preface to con- 
tent analysis (Stone et al 1966). This ability is no longer considered the prov- 
ince of CAQDA packages, and for many ethnographers, text searching within 
a word-processing file is sufficient (Stanley & Temple 1995). Specialized pro- 
grams developed for both CAQDA and commercial uses enhance the search 
and retrieval process. Many of these programs are designed for what Tesch 
called descriptive-interpretive work rather than theory building (Tesch 1990, 
1991a). For searching and retrieving, packages including GOFER, Meta- 
morph, Orbis, Sonar Professional, The Text Collector, WordCruncher, ZyIN- 
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DEX, and FYI3000PLUS expand on the capacities of word processors in sev- 
eral ways (for FYI3000PLUS, see Weaver & Atkinson 1994). 

First, these packages create and manage the ethnographic database. Some 
of these packages manage files off-line (data remain in separate, unaltered text 
files); others manipulate the data directly. Usually, document processors work 
on documents that have already been produced in a word-processing package. 
Orbis manages files produced in XyWrite or NotaBene; MetaMorph and 
WordCruncher are particularly adept with WordPerfect documents. Others 
read files produced by a variety of word-processing, database, spreadsheet, 
and even drawing programs. Nearly all can manage plain text files, and some 
packages require files to be in this format before they can work with them. 

The second value-added feature of document processors is their search fea- 
tures. As part of their management of the qualitative dataset, document proces- 
sors allow the analyst to specify a variety of computer files in which to conduct 
a single search. ZyIndex, for example, searches documents that remain in their 
native format off-line, allows the analyst to keep track of changes to docu- 
ments through several revisions, and indexes files so they can be readily in- 
cluded or excluded from particular searches. Document processors can mount 
complex searches: combinations or sequences of text strings; strings within 
specified proximity of each other; word synonyms, stems, and roots; and 
searches defined through Boolean, fuzzy, or set logic. Some display the results 
of searches interactively so that analysts can see how the addition or deletion 
of certain search terms in a complex search affects the number of hits pro- 
duced. 

Document processors are designed to make it easy for ethnographers to in- 
vestigate data they have collected. Compared to word processors, document 
processors do a better job of placing the complete ethnographic dataset in the 
hands of the analyst. They allow the ethnographer to search more easily for de- 
sired pieces of text and to investigate how the text is arranged in the dataset. 
But document processors place some limitations on the format of data, espe- 
cially on the use of non-text data such as drawings, figures, or other freehand 
notes. Although searching and retrieving text from an ethnographic database is 
a relatively non-invasive way of using CAQDA software, ethnographers must 
not be lulled into a false sense of security. CAQDA software betters the odds of 
finding significant material in the ethnographic database, but it does not assure 
it. 


Data Organization 


Searching and retrieving allows the analyst to inspect but not alter the ethno- 
graphic database. However, CAQDA packages such as askSam, Folio Views, 
MAX, Tabletop, HyperQual2, Kwalitan, Martin, QUALPRO, and The Eth- 
nograph allow the analyst to alter the form of the ethnographic database by or- 
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ganizing its text. Data organization is one of the dominant forms of contem- 
porary CAQDA, and the packages listed here include some widely discussed 
in the literature (see Armstrong 1995; Mangabeira 1995; Smith & Hesse-Biber 
1996; Sprokkereef et al 1995; Weaver & Atkinson 1994, 1995). 

Organizers expand on document processors in two ways. First, organizers 
allow the ethnographer to attach a structure to the ethnographic database. 
Some document processors can retrieve text chunks in context. Organizers 
create context by giving analysts control over the structure of the ethnographic 
database, and this structure can be manipulated and analyzed by the researcher. 
Organizers can also structure the ethnographic database by adding database 
fields for factual information and for memos that are produced during analysis. 
The second addition of organizers is the ability to code ethnographic data ac- 
cording to a theoretical scheme developed by the analyst. Organizers are de- 
signed to tag chunks of text with analytical codes and to retrieve codes and 
tagged text. Retrieval of codes frequently includes the ability to search for 
multiple codes, to retrieve the text associated with codes, or to count codes. 


ORGANIZING AND ANNOTATING Organizing and annotating are two basic 
tasks of qualitative data analysis. Some computer applications are designed to 
translate these activities with fidelity from hard copy to electronic form. For 
example, HyperQual2 and Martin use note cards as an organizing metaphor. 
Like their hard-copy counterpart, the note cards of these CAQDA packages 
each contain a single chunk of text. Electronic cards can be replicated and 
sorted into stacks, and these stacks then provide the raw materials to write up 
memos, annotations, and the ethnographic report. Another way to organize a 
hard-copy database is to use database-like fields. Fields can contain a variety 
of information including factual information that situates the ethnographic text 
to which it is attached (data collector, date of interview or observation, infor- 
mation about the subject of the note) or analytical information about the text it- 
self. CAQDA software such as askSam facilitates the creation, insertion, and 
organization of these fields. Once organized, these CAQDA programs can 
quickly search and retrieve information from database fields and quickly count 
and tabulate the results of these searches. 

Note cards, memos, and database fields are easily grasped metaphors for or- 
ganizing data; they have been used by generations of ethnographers. Other 
CAQDA software draws on metaphors without long pedigrees in the ethno- 
graphic community. Some of these packages, such as MAX and Tabletop, 
move the qualitative researcher closer to a quantitative research style. MAX 


4Most CAQDA packages have capabilities that defy easy categorization according to the kind 
of analysis they perform Among document processors, Sonar Professional, ZyINDEX, GOFER, 
and FYI3000PLUS also include significant data-organizing features. Similarly, askSam, Folio 
Views, MAX, and Kwalitan are able to search for and retrieve text from the ethnographic database 
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allows ethnographers who have also collected quantitative data to integrate 
both text and numbers into a single analytical space. Tabletop displays rela- 
tionships between previously identified features of the ethnographic database 
in graphs such as Venn diagrams and scatter plots. Finally, packages such as 
Folio Views provide a menu of organizational tools that includes outline lev- 
els, database fields, and “pop-up” notes. It is up to the analyst to determine 
which tools facilitate appropriate organization of the ethnographic database 
and how they should be applied. 

CAQDA packages that accommodate organizing and annotating the eth- 
nographic database are useful in a variety of situations, but they are particu- 
larly useful 1n research projects as they expand in size and scope. Multisite or 
multiyear ethnographic projects generate a plethora of notes that beg for effi- 
cient organization. Flexible annotations are particularly valuable in multi- 
researcher projects in that each researcher provides her or his own analysis and 
commentary. 


CODING, RETRIEVING, AND COUNTING Coding and retrieving is one of the central 
tasks of CAQDA software packages. Many of the software packages discussed 
above can code textual data, retrieve text based on applied codes, and tabulate 
which codes have been applied to which text. Most packages discussed in this 
section and below use coding and retrieving as their primary method of analysis 
or as a preface to other kinds of analysis. There are many ways to apply codes to 
text. Software such as Kwalitan, QUALPRO, or The Ethnograph number each 
line in the ethnographic database and apply codes to specific lines. Some pack- 
ages encourage coding on the computer screen, whereas others encourage the 
analyst to code a numbered print-out of the text for later entry. Multiple codes 
can be applied to one line or chunk of text, but some packages place limits on the 
number of codes that can be applied (Coffey & Atkinson 1996). No CAODA 
package eases the intellectual labor involved in coding, but code-and-retrieve 
software eases the administrative labor of applying and altering a coding 
scheme. This is especially so for software packages that take advantage of the 
graphical user interface of the Macintosh or Windows operating systems; in 
these, the analyst uses a mouse to highlight a text passage on the computer 
screen and then selects the code that applies to that section. 

Once codes are applied to the ethnographic database, CAQDA software 
greatly accelerates analysis based on retrieving codes. Code-and-retrievers 
find codes using the same powerful features that document processors applied 
to the raw database. Multiple codes may be searched for at once. Hierarchies of 
codes can be established so that searches for higher-order terms also retrieve 
instances of lower-order terms. Complex searches can be formulated using 
Boolean, sequential, and proximity logic. Retrieval may yield a display of text 
associated with a code or a union of codes, or it may yield counts where those 
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codes were applied. A number of CAQDA packages support cross-tabular dis- 
plays of counts. 

Organizing with CAQDA alters the ethnographic database in two ways. 
First, the database can be organized using database fields, hierarchical levels, 
or annotations so that the analyst has an easier time placing data in context and 
moving about in large ethnographic databases. Second, the database can be or- 
ganized by applying codes to the text of the database so that the analyst can re- 
trieve information from the database based on a theoretical mark-up ofthe text. 
CAQDA software facilitates the administration of both of these activities, but 
it does little to guide the intellectual work involved. 


Symbolic Manipulation 


A fine line separates CAQDA packages that organize data from those that ma- 
nipulate symbols. Symbolic manipulation software helps the analyst develop 
or test theories about relationships in the ethnographic database. Like data or- 
ganizers, symbol manipulators are a popular form of CAQDA, and software 
packages such as NUD.IST, AQUAD, ATLAS/ti, Inspiration, MECA, Meta- 
Design, SemNet, HyperRESEARCH, and QCA are widely discussed in pub- 
lished literature (Hesse-Biber et al 1991, Huber & Garcia 1991, Muhr 1991, 
Ragin 1987, Richards 1995, Richards & Richards 1991b). 

There are three kinds of CAQDA software for symbol manipulation. Some 
symbol manipulators begin where code-and-retrievers leave off. These pack- 
ages focus analysts’ attention on the coding process, encouraging them to cre- 
ate positive links between codes and to develop theory as they create a coding 
scheme. A second form of symbol manipulation is done by theory-building 
software. These packages take material that has been abstracted from the eth- 
nographic database through coding or other means and analyze relationships 
between codes or concepts. The final kind of CAQDA software that facilitates 
symbolic manipulation is hypothesis testers. These packages facilitate the ad- 
vancement and testing of causal statements about relationships between codes 
or concepts in multiple cases in the ethnographic database. 


VALUE-ADDED CODERS The coding process already contains the seeds of 
symbol manipulation. Value-added coders add additional coding and analysis 
features to allow the analyst to move closer to the manipulation of con- 
cepts—usually by moving further from the ethnographic text. Software pack- 
ages such as AQUAD allow the analyst to search purposefully through the eth- 
nographic database for combinations of codes. The analyst can look for theo- 
retically significant combinations of codes, tabulate the number of instances, 
and compare them to counts for combinations of codes that represent compet- 
ing theories. Value-added coders consider the ethnographic database on a 
case-by-case basis so the counts and cross-tabulations they produce are a case- 
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based numerical summary in contrast to the variable-based summaries pro- 
vided by quantitative analysis. 

À second way of transforming coding into symbol manipulation is to in- 
volve the computer in the construction of the coding scheme. NUD.IST and 
other packages force analysts to develop hierarchical relationships between 
codes as they apply them to the ethnographic database (Richards & Richards 
1995). The construction of hierarchical categories theoretically concretizes the 
codes used and makes the logic ofthe coding scheme explicit as it is developed 
and applied. Hierarchical coding schemes are particularly useful for grounded- 
theory analysis, where new codes and elaboration of existing codes occur con- 
tinuously as the analyst works with the ethnographic database. 

Other value-added coders involve the computer in the coding process with- 
out imposing hierarchical constraints on the coding scheme. In ATLAS/ti, for 
example, the coding scheme is not constrained by the software but is retained 
to manipulate and analyze on its own. Text, codes, and memos can be linked in 
the program and these links later inspected and manipulated in conjunction 
with the original ethnographic text. Maps of relationships between elements in 
the database provide an analytical metaphor distinct from quantitative sum- 
mary statistics or cross-tabulations. 


THEORY BUILDERS Compared to value-added coders, theory-building 
CAQDA software moves the analyst a step further from the ethnographic text. 
Software packages such as ETHNO, Inspiration, MECA, and MetaDesign are 
designed to facilitate the conceptual manipulation of ethnographic data. 
Theory-building CAQDA software packages do not actually construct theory, 
of course. They construct a graphical map (node and links) of ethnographic 
data. Nodes represent data (fieldnotes, memos, codes, etc), and links represent 
relationships between data. Maps may help the analyst picture the project's 
theoretical shape, the concepts in use, the relationship between those concepts, 
and the ethnographic data that have been collected regarding each of those 
concepts and links. Theory-building software facilitates experiments with dif- 
ferent concepts and links within the research project. 

But theory-building CAQDA packages need not be reserved for the arm- 
chair ethnographer idly speculating on abstract relationships in field data. 
Theory builders can also incorporate links to the original ethnographic text 
that encourage grounding in the original data and checks on concept validity. 
In addition, theory builders need not be reserved for analyses of a nearly fin- 
ished research project (nor need they be the exclusive province of the principal 
investigator). Theory builders can aid ethnographers who are mapping com- 
plex empirical concepts or events during the course of fieldwork. 


HYPOTHESIS TESTING Some value-added coders such as HyperRESEARCH 
and AQUAD as well as stand-alone packages such as QCA use hypothesis 
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testing, the third form of symbol manipulation. Hypothesis-testing software 
bridges the gap between qualitative and quantitative analysis by facilitating 
case-based analysis of qualitative data. These packages allow the analyst to 
specify hypotheses based on codes applied to text (in HyperRESEARCH and 
AQUAD) or based on a descriptive matrix of cases (in QCA). Hypothesis 
testers determine how causally antecedent features of cases are related to 
outcomes. Boolean algebra is used to define the antecedent conditions for 
each case in the database. CAQDA software reduces large numbers of cases 
into statements that identify under what conditions the outcome of interest pre- 
vails. 

Qualitative hypothesis testing determines what qualities of cases are crucial 
for a specified outcome. In contrast, quantitative hypothesis testing focuses on 
the contribution of different variables to the outcome. Aside from this differ- 
ence, CAQDA packages that include hypothesis-testing features are similar to 
statistics software that dominates computer-assisted analysis of quantitative 
data. Hypothesis testers encourage the analyst to develop ideas in the form of 
equations (Boolean rather than arithmetic) and to investigate how different 
terms (binary codes rather than multivalue variables) in the equation affect its 
ability to accurately explain outcomes. 

Stand-alone hypothesis testers remove the analyst from the original eth- 
nographic database. These software packages are useful in the analysis of 
data from a variety of sources and not only from ethnographic field studies. 
Hypothesis testers that include search-and-retrievers or data organizers 
may encourage the analyst to remain in contact with the ethnographic data- 
base even as analysis proceeds along more abstract and quasi-quantitative 
avenues. Ideally, hypothesis-testing software allows the analyst to ensure re- 
liability through hypothesis checking and to maintain validity by returning 
frequently to re-examine the original ethnographic database and the codes, 
memos, and annotations that have accumulated over the course of the re- 
search project. 

Symbol manipulation includes a variety of techniques for analyzing ethno- 
graphic data in ways that take advantage of microcomputers. Value-added 
coders encourage the analyst to develop explicit links between codes and data 
as the analysis proceeds. The software keeps track of the relationships between 
codes as they develop and then makes them available for later re-inspection 
and analysis. Theory builders facilitate exploration of concepts in ethno- 
graphic research projects through graphical displays and the ability to quickly 
move between different levels of detail. Finally, hypothesis testers move 
CAQDA closer to the practices of quantitative research by embracing the goals 
of reliability and explanation. Hypothesis-testing packages may even allow 
analysts to strive for reliability and causal explanation without losing the tradi- 
tional advantages of qualitative data with respect to validity. 
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FUTURE DIRECTIONS AND RELEVANCE OF CAQDA 


The current state of computer-assisted data analysis among qualitative re- 
searchers resembles the proverbial water glass that may be either half full or 
half empty. The large number of CAQDA software programs available sug- 
gests that we are in a preliminary stage of computer entry into the qualitative 
field. With time, the computer will do for qualitative data analysis what it has 
done in the quantitative realm: reduce labor, regularize procedures for data 
gathering and analysis, and establish conventions for the reporting of results. 
Moreover, the diversity of program options will allow these advances to occur 
along parallel methodological lines so that regularizing data-handling proce- 
dures will not require homogeneous epistemological stances. On the other 
hand, the still infrequent mention of CAQDA in ethnographic writing means 
that the expansion of software choices has not yet influenced the course of eth- 
nographic research. CAQDA may be a significant advance for positivist eth- 
nographers, but its potential for regularizing analysis in the qualitative field 
has not been reached. Of course, inertia among researchers and peer reviewers 
may account for some of the gap between expanding software choices and the 
dearth of CAQDA mention in published research (Lee & Fielding 1991:9). But 
there are fundamental issues about qualitative data analysis that inform the 
half-full and half-empty perspectives on CAQDA. 


Lack of the "Killer App " 


CAQDA software has proliferated in the last decade and a half, but no “killer 
app” has emerged from among the ranks of CAQDA software (Blank 1991). A 
killer app is a computer application that makes the use of the computer irre- 
sistibly compelling by doing tasks unmanageable without computer assis- 
tance, in the fashion that spreadsheet programs VisiCalc and Lotus 1-2-3 moti- 
vated United States businesses to place personal computers on employees’ 
desks. Most CAQDA software diminishes the amount of labor needed to or- 
ganize and code ethnographic data but does not fundamentally change the pro- 
cess of ethnographic analysis. In fact, ethnographers considering computer use 
must scale several learning curves (which programs are available, what are the 
basics of seemingly appropriate ones, what is the actual operation of the se- 
lected one) and then shape their data and analysis to the requirements of the 
chosen software package. Lacking an irresistibly compelling reason to adopt 
CAQDA, ethnographers may forgo computer assistance simply because the 
costs outweigh the benefits. 

The computer offers three ways of facilitating qualitative analysis that may 
lead to, but are no guarantee of, the enthronement of a CAQDA killer app. 
First, CAQDA packages reduce the administrative burdens of ethnographic 
analysis. Administrative assistance is a strong reason to climb learning curves 
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in some research projects, such as those that use grounded-theory methods or 
those large projects that involve multiple sites or multiple researchers. But 
given the diversity of techniques for ethnographic analysis, administrative re- 
duction is compellingly attractive to only a fraction of qualitative researchers. 
Second, many CAQDA programs allow the user to analyze ethnographic ma- 
terials that are difficult to access without the computer. These packages inte- 
grate text, graphics, sound, and video; they encourage analysis based on the 
creation of links between distinct pieces of the ethnographic database; and 
they open up new possibilities for the presentation of ethnographic research. 
However, not only do many ethnographers work exclusively with text, but also 
text and graphics are the dominant form of the ethnographic report. Multi- 
media capability alone does not create a killer app. Third, some of the features 
of symbol-manipulation software are not easily replicated without a computer. 
Potentially, these packages contain the seeds of a killer app. 

Any CAQDA software that aspires to the title of killer app must accomplish 
two tasks. Like symbol-manipulation software, it must offer analysts the ability 
to perform analyses that are unmanageable without a computer. To be compel- 
ling, the CAQDA software package will have to constitute its own best market- 
ing device. In addition, the methodological and epistemological diversity of eth- 
nographic data analysis means that CAQDA software will have to offer differ- 
ent analytical facilities to different analysts. The challenge of the CAQDA kil- 
ler app is to facilitate the analytical strategies of positivists and non-positivists 
with diverse analytical goals without disproportionately imposing barriers to 
entry on any one group. At present, popular software packages meet the chal- 
lenges of one group or another, but no killer app appears to be on the horizon. 


The Crisis of Representation and CAQDA 


Part of the reason that no CAQDA package is poised to become a killer app is 
that contemporary methodological discussions in ethnography are not related 
to the integration of computers into qualitative data analysis—a fact our sur- 
vey of recently published ethnographic work makes clear (see footnote 1). 
References, implicit and explicit, to the double crisis of representation and le- 
gitimation, what Denzin & Lincoln have termed the fifth “moment” of qualita- 
tive research, appear frequently in published ethnographies, and the crisis is of 
great concern to methodologists (Denzin & Lincoln 1994b, 1995; Snow & 
Morrill 1995a, but see also Snow & Morrill 1995b).° 

In this climate of ferment, the rules for analysis are open to question, and 
one of the CAQDA paradigms may emerge to organize future qualitative 
work. Users and developers of hypertext software are particularly excited and 


>We hope not to open the Pandora’s box of ethnography’s decades-long crisis of representation. 
Consult Denzin & Lmcoin 1994a for a variety of perspectives on this question. 
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optimistic about this prospect. Hypertext not only makes the case for CAQDA 
as a killer app but also addresses the limitations of previous conceptions of 
computer use. Hypertext analysis is less rigid, more susceptible to interpreta- 
tion, and most importantly, not lineally descended from the numerical process- 
ing paradigm used by quantitative researchers. 

But the crisis in legitimation is a particularly hostile atmosphere for 
computer-assisted methods that are often associated with a positivistic ap- 
proach to data analysis. Qualitative researchers have already expressed con- 
cerns about the use of CAQDA in practice. One fear is that the computer will 
"take over" qualitative data analysis—turning against the ethnographer like 
Frankenstein's monster. Theoretically, this fear is calmed by the reminder that 
the real work of qualitative data analysis lies not in the mechanics of searching 
for text, applying codes to data, or testing hypotheses using those codes. 
Rather, the work lies in the annotation and rewriting of notes, in the conceptu- 
alization and development of a coding scheme, and in the art of proposing rea- 
sonable hypotheses (Hesse-Biber 1995). Practically, researchers report that 
the use of CAQDA software encourages the exact opposite of the Frankenstein 
scenario. Outside the computer—in piles and files of note cards, transcripts, 
and memos or in boxes of audiotape—the data overwhelm the ethnographer. 
The computer allows the ethnographer to manage the overwhelming amount 
of data. This encourages the ethnographer to approach the data and become 
comfortable *playing" with it and learning it (8mith & Hesse-Biber 1996). In 
short, rather than distancing the ethnographer from the data, the use of the 
computer reduces the distance between analyst and data by making the latter 
less overwhelming and more approachable. The computer can facilitate the 
analyst's movement away from the data, but it does not cause this movement. 

There remains, of course, the issue of what the ethnographic enterprise is. 
When it is based solely on “understanding,” as it is for the traditions of eth- 
nomethodology and symbolic interaction, computer assistance cannot make up 
for the shortcomings in the researcher's basic talent to interpret. However, for 
ethnomethodologists interested in analyzing indexical language patterns, for ex- 
ample, CAQDA software has the potential to be quite helpful (see, for instance, 
Schegloff 1996). In the case of anthropology, there remains the question of 
whether it is necessary for the ethnographer to penetrate the psychological world 
of the native or simply to interpret it through what Geertz identifies as a series 
of symbolic forms—words, images, institutions, and behaviors (Geertz 1983: 
58). CAQDA can help the researcher identify social patterns, but it cannot sub- 
stitute for the insight of the researcher. For example, Geertz found in his study of 
Moroccan society that the linguistic concept of “nisba” was important in sepa- 
rating people from each other and determining what it meant to be a person. 
Using suffixes, the people he studied were able to identify who belonged to 
what tribe, city, family, etc. Although Geertz found this pattern without the use 
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of CAQDA, computer assistance would have increased the probability that a 
less insightful analyst would have seen this concept recur in a variety of contexts 
and grasped its significance. CAQDA can compensate for the limitations of the 
fieldworker by highlighting significant patterns recorded in the notes, even if 
the researcher did not recognize the pattern at the time the notes were recorded. 


Systemization of Ethnographic Methods 


For ethnographers not torn by the twin crises of representation and legitima- 
tion, the advent of CAQDA opens a couple of possibilities. First, the use of 
CAQDA software makes explicit the methods of analysis used in converting 
ethnographic data into ethnographic reports. The explicit discussion of meth- 
ods of analysis in the grounded-theory school midwifed the development of 
much CAQDA software. Computer-assisted analysis goes beyond discussion, 
however, by allowing ethnographers to share details of their analysis process. 
Even when ethical concerns prevent the sharing of raw data, the use of 
CAQDA may increase reliability by making explicit the concrete steps taken 
in moving from data to conclusion. 

Second, the use of computers fosters increased reliability and generalizabil- 
ity by expanding the amount of data that can be managed and exhaustively ana- 
lyzed within a single ethnographic project. Data expand rapidly in ethnogra- 
phies involving multiple sites or multiple researchers. In large-scale sociological 
and anthropological studies, the senior researcher becomes the analytical spe- 
cialist (examples undertaken without CAQDA include Lewis 1963, Moore & 
Garcia 1978, Rainwater 1970, Sullivan 1989, Warner 1963). All members of 
the research team funnel data to the leader, who guides the analysis and writes 
research reports. Computer assistance makes it possible for researchers to col- 
laborate more easily as data management devolves to the database system. 

Combined, explicit systems of analysis and increased ability to generalize 
reliably suggest the development of a new way of organizing data, asking re- 
search questions, and systematically developing answers in ethnographic re- 
search. CAQDA software may allow ethnographers to access large ethno- 
graphic databases directly— without the theoretical intermediary of a single in- 
tellectual vision or research goal. The computer can accommodate data col- 
lected by multiple fieldworkers and facilitate coding, re-coding, linking, and 
re-linking by multiple investigators. Within this analytical space, differing un- 
derstandings of the same database can be produced and compared, and ana- 
lysts can examine the procedures undertaken to produce each account. 


CONCLUSION 


To date, all that many ethnographers have had to rely on was their memory of 
the data they collected and the meaning of those data in the context of their 
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study. However, the workings of memory create two potential problems for re- 
searchers analyzing ethnographic field data. First, researchers may use those 
data that were most dramatic in the fieldwork and erroneously present them as 
being the most significant; second, they may use more data from the later 
stages of fieldwork and less of what happened in the middle or beginning be- 
cause the later data are fresher and clearer in their minds. CAQDA can help the 
careful analyst avoid both of these problems, but it is no panacea. Researchers 
who use CAQDA still face issues related to representation. Data quality is di- 
rectly tied to the ability of the researcher to observe significant phenomena in 
the course of fieldwork and to recognize what he or she has seen. While 
CAQDA can compensate for small failures of detailed observation or sharp in- 
sight, it is no substitute for either. 

The use of CAQDA could stimulate team approaches in ethnographic re- 
search that would generate a wealth of data and make important analytic con- 
tributions (see the examples cited above), but CAQDA does not eliminate the 
validity problems inherent to team ethnography. Because data in ethnographic 
teams are gathered by a number of researchers who in many cases have differ- 
ent degrees of training (as well as different degrees of insight), there is no way 
to assure consistency in what each researcher thinks it important to record. 
Thus, there are validity problems for which CAQDA cannot compensate. Eth- 
nographers spend much of their time engaged in filework rather than fieldwork 
(Plath 1990), but quality analysis that has a high degree of validity and reliabil- 
ity remains dependent on the competence and consistency of fieldworkers. 
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ABSTRACT 


The aim of this paper 1s to provide an overview of contemporary sociology in 
Japan. The sociological discipline in Japan has made remarkable progress 
since World War II. The rapid economic growth during the 1960s, which was 
prompted by Western influence, had an effect on the direction of the disci- 
pline's development. After providing a brief overview of historical events, I 
consider scholarly publications during the last 30 years to illuminate sub- 
stantive trends in sociological research. As might be predicted, the changing 
substantive interests of Japanese sociologists have been in many ways a re- 
sponse to societal changes and needs. I also examine the trends in theory, 
methods, and empirical research. A content analysis of articles published in 
the major Japanese sociological journals shows the growing emphasis on 
theoretical scholarship and the decreasing emphasis on empirical research in 
recent years. The conclusion presents some considerations concerning the 
future direction of Japanese sociology. 


INTRODUCTION 


The aim of this paper is to provide an overview of contemporary sociology in 
Japan. The development of sociology in any country is closely tied to its his- 
torical events and other circumstances in the society because the objects of so- 
ciological observation are society, social phenomena, and individuals in the 
society. Japanese sociology 1s no exception. The sociological discipline in Ja- 
pan has made remarkable progress since World War II. The rapid economic 
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growth during the 1960s, which was motivated by Western influence, had an 
effect on the direction of the discipline's development. I first provide a brief 
overview of the historical events that helped shape the development of Japa- 
nese sociology. I then consider scholarly publications during the last 30 years 
to illuminate substantive trends in sociological research and the general state 
of the discipline. In this way, I show that the changing substantive interests of 
Japanese sociologists have been in many ways a response to societal changes 
and needs. I also examine the trends in theory, methods, and empirical re- 
search. À content analysis of articles published in the major Japanese socio- 
logical journals is provided to show the growing emphasis on theoretical 
scholarship and the decreasing emphasis on empirical research in recent years. 
My discussion concludes with some considerations concerning the future di- 
rection of Japanese sociology. 


HISTORICAL BACKGROUND 


The end of World War II marks the beginning of contemporary Japanese his- 
tory. After the surrender in 1945, the country was economically devastated and 
its people demoralized. During the subsequent six years of American occupa- 
tion, democratic systems, values, and ideologies were infused into Japanese 
politics, economics, education, and other social institutions. À new constitu- 
tion was implemented in 1947 that guaranteed freedom of speech, religion, and 
assembly, and provided various civil liberties including the right to vote for 
both men and women. Reforms undertaken during the American occupation 
included the dissolution of the large family-based corporations called Zai- 
batsu, the authorization of workers to organize unions, and the implementation 
of agricultural reforms such as land redistribution. 

The first decade after the war's end was a period of reconstruction and re- 
covery from the damage resulting from the war. Following that, into the 1960s, 
Japan made a remarkable economic recovery as its GNP surged to become the 
second highest in the world. With the growing emphasis on industrialization as 
a means to economic growth, urban infrastructural investment and rapid rural- 
to-urban migration changed the nature of many Japanese cities. 

By the mid-1960s, Japan had completed its economic rehabilitation; how- 
ever, it also began to face new types of social problems such as environmental 
pollution, the undesired byproduct of rapid industrialization. Once basic eco- 
nomic needs were satisfied, the Japanese began to pay more attention to social 
issues, including the environment, social inequality, and civil rights. The 
movements by students and other civil groups in the late 1960s were partially 
responsible for the government's growing attention to social concerns. 

The public's social awareness continued to grow throughout the 1970s de- 
spite the slowdown and stabilization of economic growth. The 1970s marked 
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Japan's emergence as a major contender in the world economy. With its inclu- 
sion in the economic summit in 1975 and the opening of its domestic market to 
foreign trade, Japan reversed its previous course of economic protectionism. 
Its increased immersion in the global economy stimulated steady economic 
growth that peaked in the late 1980s. Having become a key player in the world 
economy, Japan in the 1990s faces the crucial task of defining its international 
role within the global community. 


Traditional Japanese Sociology 


Prior to World War II, Japanese sociology closely reflected its origins in eth- 
nological research, from which the substantive areas were beginning to 
emerge. Reflecting the influence of ethnology, two major areas of inter- 
est—rural sociology and family sociology —dominated the discipline. Most 
sociologists in the pre- World War II epoch were trained in these two areas, 
which focused on rural community and area studies and also on traditional 
Japanese family systems. The traditional Japanese family system, called Je, 
not only was a family, as defined in Western society, but also functioned as an 
economic unit based on extended kin relations. The internal hierarchical struc- 
ture of Je, the acceptance of an authority structure, the pattern of relationships 
between Je and the community, and the norms developed within the Je system 
were all thought to be the basis for understanding the social structure and so- 
cial organizations in Japan. 

As sociologists expanded their substantive interests beyond the domain of 
the family, they continued to apply this concept to explain other aspects of so- 
cial life (Kiyomi Morioka, personal communication). For example, internal 
structures of various social groups such as business and religious organiza- 
tions, and the pattern of relationships—including labor relations— were re- 
Jated to the traditional /e concept. While they had undergone some changes, 
family sociology and rural sociology remained dominant in sociological re- 
search after World War II.! 


JAPANESE SOCIOLOGY IN A HISTORICAL CONTEXT: 
ANALYSIS OF SUBSTANTIVE AREAS 


In this section, I depict the development of sociological research in Japan in a 
historical context. To capture the substantive trends in the discipline, I exam- 


lFamily sociology is currently considered to have two subfields. traditional and contemporary 
(Morioka et al 1993, Mitsuyosh: et al 1986, Mochizuk: et al 1987) Tradrtional family sociology 
emphasizes a traditional family system typically seen in rural areas, while contemporary family 
sociology, which was developed after World War II, focuses on issues relating to families 1n more 
urbanized areas 
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ine the data on publications, which is perhaps the best descriptor of sociolo- 
gists’ work.? 

Every year, the Japan Sociological Society (JSS) publishes a list of books 
and articles, including reports and working papers, with sociological content. 
The list is compiled by soliciting information from over 2500 association 
members and by library search. Because of the difficulty of obtaining such in- 
formation, these lists are by no means exhaustive; however, they are conceiva- 
bly the best source of data currently available for my purpose. 

Based on the primary substantive emphasis, each article is categorized into 
one of the substantive areas recognized and defined by the association. Table 1 
displays the number (and the percentage in parenthesis) of publications in five- 
year periods for each substantive area. The area categories are arranged in de- 
creasing order based on the number of publications during the most recent 
five-year period, 1990-1994. 

The total number of articles in books, journals, and other outlets has in- 
creased over time, from 3297 in the early 1970s to 7882 in the early 1990s. The 
rate of increase was slightly higher in the 1970s through early 1980s than 
thereafter, indicating the discipline’s growth in the 1970s.3 

As the number of publications increased, so did their diversity. New cate- 
gories were created to classify those articles as their numbers reached a criti- 
cal mass. In the early 1970s, four categories were added (planning, sociology 
of law, social history, and leisure/sports). Economic sociology established its 
own category in the mid-1970s. The categories of discrimination, gender/gen- 
eration, knowledge/science, and lifestyle/life structure were added in the early 
1980s, while environmental sociology, the newest category, was established in 
1993. Tracing the changes in the categories themselves is an indication of how 
the discipline has evolved to encompass a stronger emphasis on social issues in 
an industrialized society. The following section reveals that the changing rate 
of publications in the various substantive areas corresponds to the transforma- 
tions in Japanese society since 1965. 


Other informative data about the growth in sociologists’ activities over trme is academic 
institutional data such as the number of departments, faculty, and degrees granted Currently 122 
Japanese university departments offer undergraduate curricula in sociology Seventy-six 
universities provide graduate training, of which 47 offer Ph.D programs (Asahi Shimbun 1996). 
However, accurate information on the number of sociology faculty and students is difficult to 
obtain. This 1s because sociology departments 1n Japanese universities are normally included under 
larger institutional units such as humanities, social sciences, and behavioral sciences, and data on 
faculty and students 1s publicly reported only for those larger units, not at departmental levels. 

?It can be speculated that the increase in the total number of publications may be partly the 
result of improved data collection implemented over time, such as the electronic library search. 
Close examination of the data, however, found no single year that showed a dramatic increase that 
would clearly indicate such a possible artifact. 
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Japan's Post-World War II Industrialization and Sociology 


Japanese sociology entered a new phase after World War II. Sociologists’ in- 
terests reflected the nationwide commitment to industrialization and economic 
rehabilitation during the first two decades after the war. Thus, publications in 
industrial sociology, management, and labor relations proliferated during the 
late 1960s, becoming the third most popular substantive focus during that peri- 
od. The declining popularity of this research focus during the subsequent dec- 
ades (by the early 1990s, industrial and labor relations publications ranked be- 
hind eight other substantive foci) suggests that the flurry of interest in the 
1960s reflected and perhaps stimulated the intense pursuit of industrialization 
in the immediate postwar period. 

One of the most striking observations found in Table 1 1s the changing pro- 
portions of publications in rural sociology and urban sociology over 30 years. 
Rural/community sociology ranked first (9.2%) in 1965-1969 but eighth 
(5.296) in 1990—1994, while urban sociology increased its relative ranking 
from tenth (4.8%) 1n 1965-1969 to second (7.1%) in 1990—1994. The tradi- 
tional dominance of rural sociology eroded as sociologists directed more atten- 
tion to urban areas and the problems associated with urban life. Urban sociol- 
ogy established its dominance, at least in the number of publications, by the 
late 1980s. Indeed the preeminence of urban sociology since the 1980s is re- 
flected in the emergence of one of its offshoots, lifestyle/life structure, as a dis- 
tinctive substantive focus. This is an interdisciplinary substantive field dealing 
primarily with urban lifestyles and the structures around people in cities, in- 
cluding social networks. It is an area unique to Japanese sociology because of 
its focus on the traditional Japanese family system and community structure as 
they relate to contemporary social phenomena dealing with personal and fam- 
ily networks and community activities. 


Social Awareness 


The late 1960s and early 1970s was an era in which Japanese society began 
paying attention to the social costs of economic growth. Sociologists began fo- 
cusing on various social problems, including environmental pollution. The 
number of articles in social problems/deviance soared from 60 (1965—1969) to 
199 (1970-1974), and then to 284 (1975-1979). Concomitant with the stu- 
dents’ movement ofthe late 1960s, the number of publications on social move- 
ments/collective behavior increased in the 1970s relative to the previous five- 
year period. Because environmental sociology was not recognized as a sepa- 
rate subfield until 1993, previously published articles dealing with environ- 
mental issues were categorized under either social problems or medical sociol- 
ogy, depending on their emphasis. 
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Gender study is an area that has received great influence from American so- 
ciology. The women's movement of the 1960s in America captured Japanese 
public attention a decade later. A certain irony surrounded the fact that the 
country that had introduced Japan to the ideals of democracy, egalitarianism, 
and universal civil rights was now encountering political upheaval owing to a 
legacy of gender discrimination (Ishida 1996). Gender studies caught the at- 
tention of Japanese sociologists, who sought to alleviate similar contradictions 
in their own society. In the late 1970s several societies for women's studies 
were established and gender was recognized as a substantive area separated 
from social problems. Gender has yet to be formulated as a distinctive substan- 
tive category; it is currently combined with generation, which includes geron- 
tology and issues surrounding generational differences. Nevertheless, the 
number of publications on gender issues has grown rapidly within the last ten 
years (40 in 1980—1984 to 233 in 1990—1994), and it has become one of the 
major fields in today's Japanese sociology. 


International Relations 


Although Japan's economic development was motivated by Western influ- 
ence, it was not until the late 1960s that Japan began considering issues sur- 
rounding its relationships with the developed countries ofthe West. As Japan's 
economy caught up with Western standards and its products gained a growing 
share of global markets, Japanese sociologists focused more attention on inter- 
national relations. Politicians, economists, and social scientists began their ef- 
forts to understand other countries and Japan's role in international relations. 
The surge of political/international relations publications in the 1970s reflects 
Japanese sociology's growing interest in global affairs. 

Even though Japan assumed an increasingly central role in the global econ- 
omy during the 1980s and 1990s, there was no proportionate increase in inter- 
national relations publications in the sociological literature. It 1s possible that 
some internationally focused sociologists may have turned from international 
relations to area studies as an avenue for their global concerns. Compared to 
the 1970s, the number of articles in comparative/area studies increased in the 
late 1980s and early 1990s (from 1.8% in 1975—1979 to 3.8% in 1990—1994). 
Prior to the late 1980s, most area studies focused on the Western, developed 
countries that provided economic models for Japan's development. Since then, 
area studies reflect the growing interest of Japanese sociologists in the devel- 
oping countries of East Asia with which the country has established increasing 
economic ties. Thus the shift in area studies reflects the changing political and 
economic alliances between Japan and other nations. 

Another recent social issue relating to East Asian countries concerns the ris- 
ing number of East Asian immigrants to Japan. Their growing presence is forc- 
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ing the country, which has long been highly racially and ethnically homogene- 
ous, to confront issues of ethnic discrimination, changing labor relations, and 
other effects of immigration. These demographic changes have stimulated a 
surge in sociological scholarship examining ethnicity, ethnic discrimination, 
and ethnic-based social movements. 


Information and Technology 


Consistent with its growing importance 1n other postindustrial societies, infor- 
mation became a valuable commodity as technological development acceler- 
ated in the 1980s in Japan. Mass media have begun to play an important role in 
the society. No longer merely a medium for communicating social events to 
the public, mass media have begun to shape peoples' perceptions of social re- 
ality. The sociological publication trend during the 1990s seems to reflect 
some of the characteristics common to information-based societies. Articles 
within the areas of culture/religion/ethics and communications/informa- 
tion/symbols have become more frequent, ranking sixth and seventh respec- 
tively in 1990 and 1994. The modest but steady increases in knowledge/sci- 
ence and leisure/sports also exemplify this trend. 


Concluding Comments 


As the quantity of publications within various substantive fields indicates, 
trends in sociological literature have been shaped by changes in the larger soci- 
ety. Clearly, publication rates alone are insufficient to fully capture the diverse 
changes occurring within a discipline and its subfields. Yet they do provide a 
rough picture of the trends within the discipline and the larger society it repre- 
sents. They also illuminate both changes and continuities within the discipline. 

While various subfields have waxed or waned in popularity, at least one, the 
sociology of education, has continually yielded numerous publications over 
the years. Its consistent predominance likely reflects the salient role of educa- 
tion, both within the discipline and within society at large, which affects such 
diverse social phenomena as socialization, social stratification, status attain- 
ment, and deviance. Yet while the quantity of sociology of education scholar- 
ship has varied little over time, by no means has the subfield been immune to 
social changes during the period. Under the American occupation after World 
War II, the Japanese educational curriculum was modified to place an increas- 
ing emphasis on liberal, instead of imperialistic, principles. Higher educa- 
tional institutions were transformed to co-ed, allowing more opportunities for 
women. As rapid economic growth, urbanization, and technological advance- 
ment within Japanese society reshaped core institutions, the educational sys- 
tem was required to provide curriculum and pedagogical methodologies to 
adequately prepare students for new institutional roles. These trends within the 
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educational system and the larger society have given rise to new social prob- 
lems, such as the pressure on children to excel on examinations to compete for 
a limited number of university slots (see Takeuchi 1981), or the bullying of 
some students by their peers (see Morita & Shimizu 1986). Such changes have 
offered fertile avenues for research, stimulating Japanese educational sociolo- 
gists to explore the implications of such trends for both the educational system 
and Japanese society as a whole. 

In sum, while the evolution of the subfields to a large extent reflects social 
trends within relevant institutions, the development of each subfield has also 
depended on other factors, both systemic and accidental. My broad-based ac- 
count of substantive trends is intended to provide an overview of the discipline 
rather than a detailed analysis of each subfield and its formative dynamics. 
Their analysis would provide fertile ground for future research in Japanese so- 
ciology. 

For further information about the development of specific substantive ar- 
eas, refer to e.g. Mitsuyoshi et al 1986, for traditional family sociology; Mo~ 
chizuki et al 1987, for contemporary family sociology; Miura et al 1986, for 
lifestyle/life structure; Nakata et al 1986, for rural sociology; Suzuki et al 
1985, for urban sociology; Naoi et al 1986, for social stratification and social 
mobility; Inagami et al 1987, for industrial sociology; Nitagai et al 1986, for 
social movements; Mitani et al 1988, for social problems; Mita et al 1989, for 
culture and social perception; Hougetsu et al 1986, for deviance and criminol- 
ogy; Akimoto et al 1985, for political sociology; Shibano et al 1986, for sociol- 
ogy of education; Shouji et al 1988, for social systems and social change; Mi- 
yake et al 1986, for religion; and Takeuchi et al 1987, for mass communica- 
tions. 


THEORY AND METHODS IN JAPANESE SOCIOLOGY: 
WESTERN INFLUENCE 


Theoretical Paradigms 


One striking observation suggested by Table 1 is the dominance of a substan- 
tive area called social thought/social philosophy/history of sociology. A 
prominent disciplinary subfield since the 1960s, it comprised over 9% of all 
sociological publications in 1990-1994. Most of the articles included in this 
category are writings about theories or philosophies of foreign social thinkers. 
Their primary purpose has been to introduce influential works of classical 
theorists such as Marx, Weber, Durkheim, and Simmel, and of contemporary 
theorists such as Parsons, Habermas, Foucault, Luhmann, and Giddens, to 
Japanese sociologists in light of each author’s own commentary, insights, and 
elaborations. A separate category called general theory includes articles on 
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and about theories that are more relevant to explaining the social phenomena 
in contemporary Japanese society. Many of these articles depart from the con- 
temporary theories developed in foreign countries. Also evident are the nu- 
merous translated books written by foreign sociologists; as Table 1 indicates, 
approximately 2596 of sociological books over 30 years are direct translations 
of foreign works. This demonstrates a clear tendency for Japanese sociology to 
incorporate theories formulated 1n Western societies. 

Marxism, which was politically suppressed during World War II, quickly 
became a major theoretical interest among Japanese sociologists after the war 
ended. As theoretical sociologists strove to understand Marxism, they focused 
on the concepts most relevant to sociology. As Japan's industrialization 
peaked in the mid- 1960s, sociologists began to embrace Parsonian theory. Fol- 
lowing the trend in American sociology, Parsonian theory attracted the atten- 
tion of a new generation of Japanese social theorists seeking to understand 
their industrialized society and the various social problems that accompanied 
it. This structural functional approach was applied and extended to various ar- 
eas such as social change and modernization (e.g. Tominaga 1965), social psy- 
chology (e.g. Sakuta 1972). small groups (e.g. Aoi 1980), and social systems 
(e.g. Yoshida 1974) (see Shiobara et al 1997). Structural functionalism contin- 
ued to be the main sociological paradigm until the late 1970s. At the same 
time, however, it was not immune to criticisms by Marxists, Weberians, and 
phenomenologists, as it was also in American sociology, which contributed to 
its declining dominance in the 1980s. 

As Japan shifted from an industrialized society to an informational society 
in the 1980s, so did sociologists’ interests. As information, rather than manu- 
factured items, became the country’s most valuable commodity, sociologists 
turned their theoretical interests to culture, mass communications, and the me- 
dia. Accordingly, postmodernism rose to ascendance as the primary theoreti- 
cal paradigm. Foucault, Habermas, and Luhmann are examples of foreign 
theorists whom Japanese theorists emulated (e.g. Hashizume, Uchida, Imada). 
Even today, no other theory has achieved preeminence over postmodernism 
(Koutou 1997), which continues to be the dominant theoretical paradigm in 
Japanese sociology. 

The influential nature of Western theories has challenged Japanese theo- 
rists to apply them, with the necessary modifications, to Japanese society. Yet 
whether Western theories can explain Japanese social phenomena, and 
whether, in general, theories developed in one country are automatically appli- 
cable to a different social context, are questions deserving consideration. Japa- 
nese sociologists generally see the benefit of applying theoretical approaches 
formulated in foreign countries to their own society. Moreover, since Japan is 
currently one of the most economically developed countries in the world, theo- 
ries meant to be generalizable to industrialized capitalist nations are in princi- 
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ple applicable to contemporary Japan. Yet every society contains a unique 
component resulting from a long series of historical events and coincidental 
circumstances. For example, according to many sociologists, many aspects of 
contemporary Japanese society, from family to corporate and political organi- 
zations, have been shaped by its traditional system rooted in feudalistic prin- 
ciples. The values formed within such traditional social structures, many 
scholars maintain, continue to govern the Japanese people's perception and 
behaviors. In a rapidly changing society, where traditional systems, structures, 
and values are quickly disappearing, Japanese social theorists are faced with 
the challenging task of constructing new theories that incorporate both the so- 
cial conditions unique to Japan and those common to other developed coun- 
tries. 


Methodology and Empirical Research 


During the 1960s the American sociological emphasis on empirical research 
took root in Japan, as Japanese sociologists increasingly came to recognize the 
advantages of applying a scientific approach to their scholarship. Applying 
quantitative techniques and other empirical methodologies learned from 
American researchers, Japanese sociologists enriched their discipline by using 
the new methodological approaches to probe important research questions. 
Their efforts spawned a proliferation of empirical research papers, which be- 
came the most common types of publications in the early 1970s. A content 
analysis of the articles appearing in the primary Japanese sociology journal 
Shakaigaku Hyouron illustrates the trends in empirical research. While no sin- 
gle journal can adequately characterize an entire discipline, Shakaigaku Hyou- 
ron is considered particularly appropriate because 1t is the official journal of 
the Japan Sociological Society and is targeted toward a general sociological 
audience.^ 

Table 2 categorizes the articles appearing in this journal between 1970 and 
1994 according to their content type: empirical, purely theoretical, methodo- 
logical, or review. We observe that 41.996 of the articles published in the early 
1970s were based on empirical research, while only 31.4% were theoretical 
works. This pattern changed toward the end of the decade, however, in that 
theoretical papers outnumbered empirically based ones in the 1980s. The early 
1990s showed a continuation of that trend, with only 20.896 of published arti- 
cles representing empirical research and 63.596 reflecting purely theoretical 
scholarship. 


4The anonymous referee system was implemented in 1979 for Shakaigaku Hyouron. As with 
any other professional journal, the selection of articles would be expected to reflect a certain degree 
of editorial judgement To minimize such possible biases, articles that appeared in special issues 
were excluded from the present content analysis. 
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Table 2 Number and percentage (1n parentheses) of articles in Shakaigaku Hyouron 
1970-1974 1975-1979 1980-1984 1985-1989 1990-1994 Total 


Theory 27 26 33 38 61 185 
(31 4) (29.2) (38 8) (38 8) (635) 0N 
Methods 0 3 3 4 6 16 
(0 0) (3 4) (3 5) (41) (6.3) (3 5) 
Empirical 36 28 21 34 20 139 
(41.9) (31 5) (247) (34 7) (20.5 | (306) 
Review 23 32 28 22 9 114 
(26.7) (36 0) (32 9) (22 4) (9 4) (25.1) 
Total 86 89 85 98 96 454 


(100.0) (100 0) (100 0) (100 0) (100.0) (100.0) 
Data Source’ Shakaigaku Hyouron, 1970-1994 


Methodology is an area 1n which Japanese sociology has yet to achieve 
much strength, especially compared to American sociology. Quantitative 
methodological papers began to appear in Shakaigaku Hyouron in the late 
1970s (see Table 2), but from this point until the early 1990s they accounted 
for a mere 3.595, on average, of all published articles. Furthermore, the appli- 
cation of advanced methodological techniques has not yet become a common 
practice. Thus, among the empirical papers published in Shakaigaku Hyouron 
between 1970 and 1994, the content analysis presented in Table 3 reveals that 
only 17.9% employed multivariate statistical techniques to analyze data. 

These trends do not imply that Japanese sociologists have been uninterested 
in empirical studies or methodological issues. A group of mathematical soci- 
ologists (e.g. Yasuda) founded its own subdiscipline and established a journal 
called Riron-to-Houhou (Theory and Methods) in 1985. Despite its recent be- 
ginnings, this journal has quickly become a prestigious outlet for empirical, 
methodological, and mathematical modeling papers. 

In contrast to increased theoretical emphasis in recent years, the data indi- 
cates a decline in the proportion of papers devoted to quantitative methodo- 
logical concerns. Even in a methodologically oriented journal such as Riron- 
to-Houhou, the proportion of quantitative methodological papers decreased 
from 60.6% in 1985—1989 to 39.3% in 1990—1994 (see Table 4). While the ac- 
tual number has increased slightly (from 20 to 24), and still represents the most 
frequent type of article seen in this Journal, a greater increase in the number of 
papers of the quantitative-methods type would have been expected from the 1n- 
crease in the total number of articles published. 

In Shakaigaku Hyouron, the professional Journal of the Japanese Sociologi- 
cal Society, empirically based papers not only constituted a diminishing pro- 
portion of publications, but they were less often based on quantitative data. In 
the early 1970s, 94.4% of empirical works were based on quantitative data; by 
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Table3 Number and percentage (1n parentheses) of empirical articles 1n Shakaigaku Hyouron 
1970-1974 1975-1979 1980-1984 1985-1989 1990-1994 Total 


Analytical method 
Multrvariate ?) l 6 3 9 4 24 
(5.7) (24 0) (143) (27.3) (200) (179) 
Nonmulti- 33 19 18 24 16 110 
variate (94.3) (76 0) (85 7) (72 7) (800) (821) 
Total (5 missing 35 25 21 33 20 134 
cases) (100.0) (100 0) (100.0) (100 0) (100.0) — (100.0) 
Data 
Survey 7 7 3 9 4 32 
(19.4) (25.0) (23 8) (26 5) (200) (230) 
Secondary data — 27 17 14 15 10 83 
(75 0) (60 7) (66.7) (44 1) (50.0) (59.7) 
Intensive 2 4 2 10 6 24 
interview (5 6) (14.3) (9 5) (29.4) (30.0 (17) 
Total 36 28 21 34 20 139 


(100 0) (100.0) (100.0) (100.0) (100.0) (100.0) 
Data Source Shakaigaku Hyouron, 1970—1994 


the early 1990s, the proportion of quantitatively oriented empirical papers de- 
clined to 70.0% (see bottom half of Table 3). During this period, empirical 
publications based on secondary data showed a dramatic decrease over time, 
dropping from 75% in 1970-1974 to 50.0% in 1990—1994, while those using 
intensive interviews became more common during this period, rising from 
5.6% to 30.0% in 20 years. 

This decreasing interest in the quantitative approach may reflect the change 
in the discipline’s dominant theoretical paradigm. The shift from structural 
functionalism to postmodernism likely stimulated a parallel movement away 
from quantitative analysis. Instead, qualitative methods and ethnomethodol- 
ogy are showing a gradual gain in popularity. 

In short, these trends suggest that Japanese sociology has become more 
theoretically oriented in recent years, yet at the same time less inclined to ver- 
ify theoretical propositions through empirical evidence. Because the link be- 
tween theory and method is made through empirical research, the decline in 
empirically oriented works may contribute to the disarticulation between theo- 
retical and applied scholarship as well as the stagnation of social methodology. 

Although the overall number of empirical publications has decreased in re- 
cent years, there are particular substantive areas in which empirical research 
has been consistently strong over the years. These include family, rural, urban, 
and social stratification and social mobility. Family and rural sociologists have 
traditionally been trained to use empirical research. Having their roots in eth- 


SOCIOLOGICAL WORK IN JAPAN 513 


Table 4 Number and percentage (in parentheses) of articles in Riron-to-Houhou 


1985-1989 1990-1994 Total 

Theory 6 17 23 

(18.2) (21.9) (24.5) 
Methods - quantitative 20 24 44 

(60 6) (39.3) (46.8) 
Methods - qualitative l I 2 

(3.0) (1.6) (2.1) 
Empirical 4 10 14 

(12.1) (16.4) (14.9) 
Review 2 9 I1 

(6 1) (14.8) (11.7) 
Total 33 61 94 


(100) (100) (100) 


nology, they have relied on field work, intensive interviews, and historical 
documents for their empirical observations. With the introduction of scientific 
methods, they began using surveys as data-collection tools. As the discipline’s 
focus shifted from rural to urban studies, surveys have become a dominant 
methodology applied among urban s 5ciologists. Currently, many large-scale 
surveys are being funded within family and urban sociology. 

Social stratification/mobility researchers have played an important role in 
the field of quantitative methods and empirical research. The most prominent 
survey in Japanese sociology is the SSM (Social Stratification and Social Mo- 
bility) survey, which has been carried out every ten years since 1955. It is 
based on a national sample of over 5000 individuals selected through a multi- 
stage stratified sampling method. These surveys contain interviews whose 
items are targeted primarily toward stratification research, including measures 
of family background, occupation, education, and social status (see Tominaga 
1979 for details). Each wave of the survey also includes various questions re- 
flecting the interests of researchers involved in the survey and the broader so- 
ciological community. The results from each wave of the survey have been 
published as monographs (e.g. Tominaga 1979, Naoi & Seiyama 1990, Hara 
1990, Kikuchi 1990, Okamoto & Naoi 1990), which are regarded as the front- 
line of Japanese empirical research. 


DISCUSSION 


To summarize the development of an entire discipline is an ambitious task. In- 
stead of introducing and critiquing major theories and research in selected ar- 
eas, this paper focused on describing the overall substantive emphases within 
the discipline over time. The analyses in this paper are based on the available 
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data on sociological publications, which proved to be useful in examining the 
dynamics of Japanese sociologists’ interests and productivity. 

The theoretical and methodological trends seen in Japanese sociology in the 
last 25 years raise a few 1ssues for consideration. Ideally, theoretical direction 
and methodological focus should be related to each other, for the choice of 
methodology depends on the theoretical questions being explored. Yet the 
quantities of methodological and theoretical works should vary independently; 
an increased emphasis on one should not imply a decreased emphasis on the 
other. Moreover, if theory and methods were closely linked, the introduction 
of new theories would likely expand the quantity of empirical research. In the 
case of Japanese sociology, however, the observations of the above content 
analysis do not illustrate such linkages. The data shows an inverse pattern be- 
tween the number of theoretical and empirical papers. It would appear that the 
emerging theories are not yet being substantiated sufficiently by empirical 
data. Japanese sociology would benefit from a closer consolidation between 
theory and research, both by formulating theories into verifiable propositions 
and by using theory to 1nform empirical research designs. 

At the same time, more extensive methodological training is called for in 
Japanese universities. The lack of emphasis on methodological training is 
partly an unfortunate consequence of the structure of Japanese academic insti- 
tutions. Consistent with the vertical structure in traditional Japanese organiza- 
tions, mentorship 1s emphasized in graduate training. This results in a defi- 
ciency in structured curriculum covering a wide variety of core sociological 
courses at the graduate level. This type of training tends to produce highly 
qualified specialists in narrow fields. 

The Japan Sociological Society has recently launched its effort to establish 
a system through which survey data is made available to researchers other than 
those who actually conducted the survey. An increase in empirical research 
might be stimulated by such a mechanism. In addition to the SSM survey, nu- 
merous large-scale surveys are funded and conducted each year. If shared and 
used appropriately, these results could provide a basis for empirical research 
that would otherwise require extensive data-collection efforts. 

However, Japanese sociologists’ efforts to stimulate empirical research 
may be tied to a more fundamental issue concerning the most effective way to 
position themselves and their discipline within the global sociological commu- 
nity. Their successful efforts to assimilate the sociological achievements of the 
United States and Europe have enriched their knowledge of theories and meth- 
ods developed in foreign countries. Yet they have had less success 1n making 
their work visible to the international community of sociologists. The socio- 
logical discipline has reached a stage at which most practitioners appreciate 
the benefits of a global perspective. À more concerted interchange between 
Japanese sociologists and others in the discipline might be achieved through 
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an emphasis on comparative research that illuminates both the uniqueness and 
universality of Japanese society. Such an exchange would not only benefit 
Japanese scholars but would be highly advantageous to the entire discipline. 
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ABSTRACT 


In this paper I explore the questions of why and how sociologists should be 
interested in narrative. The answer to the first question 1s straightforward: 
Narrative texts are packed with sociological information, and a great deal of 
our empirical evidence is in narrative form. In an attempt to answer the sec- 
ond question, I look at definitions of narrative, distinguishing narrative from 
non-narrative texts. I highlight the linguistic properties of narrative and illus- 
trate modes of analysis, paying close attention to both the structural proper- 
ties of the text and its subtle linguistic nuances. I guide the reader through a 
detailed analysis of a short narrative text. I show how linguistics and sociol- 
ogy interplay at the level of a text. 


Narrative is present in myth, legend, fable, tale, novella, epic, history, trag- 
edy, drama, comedy, mime, painting (think of Carpaccio’s Saint Ursula), 
stained glass windows, cinema, comics, news items, conversations. Moreo- 
ver, under this almost infinite diversity of forms, narrative 1s present in every 
age, in every place, in every society; 1t begins with the very history of man- 
kind and there nowhere is nor has been a people without narrative. All 
classes, all human groups, have their narratives . . . narrative is international, 
transhistorical, transcultural: It is simply there, like life itself.. 


Barthes 1977:79 
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NARRATIVE: WHAT'S INIT FOR US? 


Given a set of numbers—236, 435, 218, 767, 456, 3677 —most sociologists cer- 
tainly would recognize the numbers as data. Most would also know what to do 
with them. They could plot those data, tabulate them, estimate statistical mod- 
els. Some would easily deal with the problems of first-order correlation of the 
residuals that those data and those models may produce; they would try to 
overcome problems of heteroschedasticity, non-normahty, influence, and all 
the other little and big problems that plague our statistical work. But take the 
following text. 


As soon as the water went down, I began to work digging the bodies out of 
the debris. I worked for eight days after the flood looking for bodies, and I re- 
covered twenty-two of them. The last one I found was a little five-year-old 
boy It reminded me so much of my own little boy that I could not take any 
more, That is when I went to pieces. . (Erikson 1976:165) 


As sociologists, what should we get out of this passage? Does the passage 
contain data (data?) of any interest to us? What are we to make of it? How are 
we to analyze it, to test hypotheses, to draw 1nferences? Paradoxically, we 
are more at ease in the artificial and (wo)man-made world of statistics than in 
the more natural world of language and words. After all, a course in statistics 
is part and parcel of any sociology graduate training. À course in the analysis 
of text certainly is not, and even courses in qualitative research methods pay 
little attention to texts and narratives as such. And, perhaps, before we even ask 
questions of method and analysis, there is a more fundamental question that 
begs an answer: Why should sociologists be interested in narrative? 

In this article, I take up these questions. Do not expect a comprehensive or 
summary review of the literature on narrative—of necessity, I will be rather se- 
lective.! My goal is to introduce sociologists to the basic concepts, particularly 
as elaborated by linguists, and to show how linguistics and sociology interplay 
at the level of a text. I provide definitions of narrative, distinguishing narrative 
from non-narrative texts. I highlight the linguistic properties of narrative and 
illustrate modes of analysis. As a way to review the issues involved I guide the 
reader through a detailed analysis of a short narrative text. In this analysis, I 
pay close attention to both the structural properties ofthe text and its subtle lin- 


l1 found Chatman (1978), Genette (1980), Rummon-Kenan (1983), Cohan & Shires (1988), and 
Toolan (1988) to be the best introductions to the issues Ricoeur's (1984, 1985, 1988) three-volume 
work on time and narrative is not for the fait of heart It provides a comprehensive review of the 
views on time and narrative of linguists, philosophers, historians, and sociologists It may be hard 
for the novice to follow the mtricacy of Ricoeur’s account of the various authors’ positions, despite 
Ricoeur’s lucid and clear language I would recommend reading it last 
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guistic nuances.” After all, as Chatman (1978:94) writes, “For many narratives 
what is crucial is the tenuous complexity of actual analysis rather than the 
powerful simplicity of reduction.” 

The narrative analysis of the text helps to bring out not only the properly lin- 
guistic characteristics of the story—a task perhaps better left in the hands of 
those who know how to do this best: linguists-—but also a great deal of sociol- 
ogy hidden behind a handful of lines. It is precisely because (a) narrative texts 
are packed with sociological information and (b) much of our empirical evi- 
dence is in narrative form that sociologists should be concerned with narrative. 
[Just think: Even the quantitative sociological method par excellence, the sam- 
ple survey, often hides powerful narratives behind its numbers (Mishler 
1986:72).] 


NARRATIVE AND NARRATIVE ANALYSIS 


Labov defined narrative as "one method of recapitulating past experience by 
matching a verbal sequence of clauses to the sequence of events which (it is in- 
ferred) actually occurred" [Labov (1972:359—60); see also Labov & Waletzky 
(1967:20)]. That definition has survived, more or less intact, through the years 
and through a number of hands that have pulled and pushed it from different 
angles. We find it in Rimmon-Kenan: “... narrative fiction ... [is] a succession of 
events" [Rimmon-Kenan (1983:2—3)]; in Cohan & Shires: “The distinguishing 
feature of narrative is its linear organization of events" [Cohan & Shires 
(1988:52—53)]; and in Toolan: “A minimalist definition of narrative might be: 
‘a perceived sequence of nonrandomly connected events’” [Toolan (1988:7)]. 
It is not surprising that all these definitions are in basic agreement. After all, 
they have common roots in the work ofthe Russian formalists of the beginning 
of the twentieth century, Propp (1968) and Tomashevski (1965) in particular. 
It is the Russian formalists who introduced the distinction between story vs 
plot in narrative (fabula vs sjuftet). Building upon Aristotle's idea of plot- 
structure or mythos—in the master's own words, “By this term ‘plot-structure’ 
I mean the organisation of the events” (Halliwell 1987:Ch. 6, p. 37)—Toma- 
shevski (1965:67) wrote, “Plot is distinct from story. Both include the same 
events, but in the plot the events are arranged and connected according to the 
orderly sequence in which they were presented in the work." He continues in a 
note, “In brief, the story is ‘the action itself, ... [the plot is] how the reader 
learns of the action.’” Basically, a story refers to a skeletal description of the 
fundamental events in their natural logical and chronological order (perhaps 


2The latter type of analysis has fallen under the domain of discourse analysis [e.g. the four 
volumes edited by van Dyk (van Disk 1985) or the easier treatment by Fairclough (Fairclough 
1995)] 
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with an equally skeletal listing of the roles of the characters in the story) (Bal 
1977:4; Toolan 1988:9). 

The French structuralists adopted the basic distinction fabula vs sjufiet, 
coined their own terms for the dichotomy histoire vs discours [see Benveniste 
(1971:206—8); Barthes (1977); Chatman (1978:19); Toolan (1988:11—12)], 
Story vs discourse, and further subdivided the plot/discourse level into text (or, 
more generally, discourse) and narrating or narration, i.e. “the act of narrating 
taken in itself" (see Genette 1980:27, Toolan 1988:10-11). 

Bal also speaks of a “three-level hierarchy, histoire, récit, narration” (Bal 
1977:5—6) and of the "three aspects of narrative," story, text, and narration 
(Genette 1980:25—26; see also Rimmon-Kenan 1983:3-4, Cohan & Shires 
1988:53). As usual, the trouble is that subtle differences exist among authors 
not only in the narrative levels and labels, but also in their definitions that 
make ploughing through the literature an unnecessarily difficult task [see 
Toolan (1988:9—11) on this point]. Nonetheless, we could summarize the dis- 
tinctions that linguists have introduced in the study of narrative in the follow- 
ing Way: 


Story/fabula/histoire 
Narrative 
Text/sjufiet/discourse 
Plot 
Narrating/narration 


It is the story—the chrono-logical succession of events—that provides the 
basic building blocks of narrative. Without story there is no narrative. “The 
presence or absence of a story is what distinguishes narrative from non- 
narrative texts" (Rimmon-Kenan 1983:15). “A story may be thought of as a 
journey from one situation to another," wrote Tomashevski (Tomashevski 
1965:70). A story, in other words, implies a change in situations as expressed 
by the unfolding of a specific sequence of events. The chronological sequence 
is a crucial ingredient of any definition of story. Tomashevski, Labov, Prince, 
Bal, Todorov, Rimmon-Kenan, and Cohan & Shires all drum away at that 
same point [Tomashevski (1965:70); Labov (1972); Prince (1973:23); Bal 
(1977:7); Todorov (1977:111); Rimmon-Kenan (1983:19); Cohan & Shires 
(1988:53—54)]. 

Not every sequence of any two temporally ordered events can constitute a 
story (Rimmon-Kenan 1983:19). Two sentences such as “Joan took her plane 
at 5 pm" and "Peter drove to the airport at 8 pm" would constitute a story only 
if later sentences established a logical connection between those two sen- 
tences, such as “They had both been looking forward to spending the weekend 
together." The temporal ordering of events in a story is a necessary but not suf- 
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ficient condition for the emergence of a story. The events in the sequence must 
be bound together by some principles of logical coherence.? At the level of plot 
the events of a story can form complex sequences by combining events in a va- 
riety of ways through enchainment, embedding, and joining | Bremond (1966); 
Todorov (1981:52—53); Rimmon-Kenan (1983:23)]. Finally, the events in the 
story must disrupt an initial state of equilibrium that sets 1n motion an inver- 
sion of situation, a change of fortunes—from good to bad, from bad to good, or 
no such reversal of polarity, just an ‘after’ different from the ‘before,’ but nei- 
ther necessarily better nor worse. 

“The inversion of an event is one of the essential features of a story," sums 
up Prince (1973:28)—the other essential feature being the temporal ordering 
of events in a story.^ In classical Aristotelian poetics, the turn of fortunes—a 
reversal—is the key characteristic of comedy and tragedy. “Reversal,” wrote 
Aristotle, “is a complete swing in the direction of the action” (Halliwell 
1987:Ch. 11, p. 42). While comedy marks an improvement of a situation, trag- 
edy marks a worsening, the “transformation to prosperity or affliction” [Hal- 
liwell (1987:Ch. 18, p. 51); on Aristotelian reversals, see Chatman (1978:85) 
and Ricoeur (1984:43-44)]. Reversals can occur repeatedly in a story along 
the sequence: initial state — disruption — new state — disruption — new state 
— ... — final state (equilibrium). Each new state is both a point of arrival and a 
point of departure, sort of a temporary equilibrium between the "before and af- 
ter," the past and the future. In Todorov's words, “The elements [ofa story] are 
related [not] only by succession; ... they are also related by transformation. 
Here finally we have the two principles of narratives" [Todorov (1990:30)]. 

In a sequence, not all events are equally consequential for change of a situa- 
tion. For Tomashevski, “Motifs [basically, actions and events] which change 
the situation are dynamic motifs; those which do not are static" (Tomashevski 
1965:70). This distinction—between those actions and events that fundamen- 
tally alter a narrative situation and the ones that do not-—recurs often in the 
field. Barthes, for instance, distinguishes between cardinal functions (or nu- 
clei) and catalyzers (Barthes 1977:93—94). Catalyzers "merely fill in the narra- 
tive space," while cardinal functions alter current states of affairs, either by 
bringing them to a new equilibrium or by disrupting an existing equilibrium. 
“Catalyzers are only consecutive [i.e. chronologically ordered] units, cardinal 
functions are both consecutive and consequential" (Barthes 1977:94). Chat- 


JHowever, as Chatman argues, readers will typically even attempt to make a story out of 
temporally sequenced but logically unrelated clauses by umplicitly supplying logical connectives 
[Chatman (1978:47, 49)] 

also Pnnce (1973:23) On the reversal of situation, see Tomashevski (1925.70—71); 
Todorov (1977.111-12, 1981 51), see also Aristotle, who first introduced the concept of reversal in 
his Poetics (Halliwell 1987:Ch. 11, p 42). Bremond (1966) also believed that all sequences are 
either sequences of improvement or deterioration (see the discussion by Rimmon-Kenan 1983 27) 
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man (1978:32, 53—56) adopted Barthes’ basic distinction with different labels: 
kernel and satellite events (labels also adopted by Rimmon-Kenan 1983:16). 
Again, although intuitively and in principle the separation of the clauses of a 
text into different functions should be straightforward, in practice, in the 
analysis of any specific text, what is kernel to one author may be satellite to an- 
other, and vice versa. Deletion of kernels would fundamentally destroy the nar- 
rative logic (Chatman 1978:53). Kernels open up narrative choices. 

Dynamic motifs, cardinal functions, or kernel events have corresponding 
linguistic markers. For Labov (1972:360—61) “a minimal narrative ... [is] a se- 
quence of two clauses which are temporally ordered. ... The skeleton of a nar- 
rative ... consists of a series of temporally ordered clauses" (called narrative 
clauses). And yet, while there may be no narrative without narrative clauses, 
not all clauses found in narrative are narrative clauses. [Danto (1985:143—81) 
similarly talks about "narrative sentences."] 

Consider Labov's example: 


(a) Iknow a boy named Harry. 
(b) Another boy threw a bottle at him right in the head 
(c) and he got seven stitches. 


In this narrative passage, only clauses b and c are narrative clauses. Clause a is 
not. It is a “free clause," in Labov's terminology; a clause that, having no tem- 
poral component, can be moved freely up and down in the text without altering 
its meaning. Not so with narrative clauses. A rearrangement of narrative 
clauses typically results in a change in meaning (“I punched this boy/and he 
punched me” vs “This boy punched me/and I punched him”) (Labov 1972:360). 
Labov not only draws a distinction between narrative and non-narrative 
clauses, he also posits the presence of six distinct functional parts in a fully 
formed narrative: (a) abstract; (5b) orientation; (c) complicating action; (d) 
evaluation; (e) result or resolution; (f) coda. Of these six functional parts, *only 
... the complicating action is essential if we are to recognize a narrative" (La- 
bov 1972:370).9 The complicating action constitutes “the main body of narra- 


>In 1925, Tomashevsk1 had already introduced the distinction between bound motifs and free 
motifs, where a motif 1s basically a unit of narrative Free motifs are those that may be safely 
omitted from a story “without destroying the coherence of the narrative", bound motifs are those 
whose deletion would disturb "the whole causal-chronological course of events" (Tomashevski 
1965:68). As Todorov notes, “optional (‘free’) propositions are such only from the pomt of view 
of sequential construction, they are often what is most necessary in tbe text as a whole" (Todorov 
1981:52—53) Indeed, it 18 likely that it is at the level of free motifs that the telling of the story (the 
plot) would differ (Tomashevski 1965:68). See also Barthes’ distinction between cardmal 
functions (or nuclet) and catalyzers (Barthes 1977 93-94). 

SL abov did not test his structural model of narrative Peterson & McCabe (1983), however, did 
analyze a set of children’s narratives using Labov’s scheme van Dyk also adopted a Labovian 
scheme in his analysis of narratives of ethnic prejudice (van Dijk 1984) For a more complete list, 
see Toolan 1988:176 
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tive clauses," and that body of clauses “usually comprises a series of events" 
(Labov & Waletzky 1967:32). Implicitly, 


Labov works on the broad assumption that what 1s said ... will not be the core 
of a story; that, rather, what is done ... will be. The “what is done" then be- 
comes (or may become) the core narrative text of clauses—actions—while 
the "what is said" becomes evaluative commentary on those actions... 
(Toolan 1988:157) 


But the "doing vs saying" distinction implicitly or explicitly is at the core of 
linguistic theories of narrative structures. In Ricoeur's words, “there is no 
structural analysis of narrative that does not borrow from an explicit or im- 
plicit phenomenology of ‘doing something" "(Ricoeur 1984:56), Bal simi- 
larly points out that "in general, narrative theorists rather tend to analyse the 
course of action to which they limit their story" (Bal 1977:89). Genette 
(1980:164, 169) distinguishes between narrative of events and narrative of 
words. 

Itis this emphasis on action (on "doing something") that has led to the privi- 
leged position of actions/events over actors in poetics. We first find this subor- 
dination of character to action in Aristotle, who wrote in his Poetics: "Tragedy 
is a representation not of people as such but of action and of life, and both hap- 
piness and unhappiness rest on action. ... and while men do have certain quali- 
ties by virtue of their character, it is in their actions that they achieve, or fail to 
achieve, happiness. ... without action you would not have a tragedy, but one 
without character would be feasible" [Halliwell (1987:Ch. 11, pp. 37-38); see 
also Ricoeur (1984:37); Barthes (1977:104); Rimmon-Kenan (1983:34)]. 

We find it also in Propp, the Russian formalist of the beginning of this cen- 
tury who left a seminal work on the structural analysis of narrative. In his study 
of Russian folk tales, Propp identified 31 basic functions (namely, spheres of 
action) that are invariant across different tales (Propp 1968). After spending 
the better part of a book discussing these functions and their roles in narrative 
sequences, Propp dedicated only a handful of pages to a quick discussion of 
characters (1968:79—91). All story characters can be reduced to a simple typol- 
ogy of seven "character roles" based on the unity of actions assigned to them 
by the narrative: the villain, the donor, the helper, the sought-for-person and 
her father, the dispatcher, the hero, the false hero. 

We still find this emphasis in Greimas, who proposed to describe and clas- 
sify narrative characters according to what they do (hence the name ac- 
tants)—once more reproducing the subordination of character to action despite 
Greimas' focus on actants (Greimas 1966). According to Greimas, six basic 
actants can be found in all narratives, working in sets of three interrelated pairs 
(Greimas 1966: 172-91), sender/receiver, helper/opponent, subject/object, 
typically represented in the following way: 
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Sender — object — receiver 


Helper — subject — opponent 


In Greimas’ model, (a) the sender initiates or enables the event; (b) the receiver 
benefits or registers the effects of the event; (c) the opponent retards or impedes 
the event by opposing the subject or by competing with the subject for the ob- 
ject; (d) the helper advances the event by supporting or assisting the subject [see 
also Toolan (1988:93); Barthes (1977:106—7); Rimmon-Kenan (1983:34—35)]. 

In general, a narrative text will comprise a mixture of both narrative and 
non-narrative clauses. In particular, descriptive and expository propositions 
typically enter into a minimal narrative (Tomashevski 1965:66; Bal 1977:13; 
Rimmon-Kenan 1983:14—15). "Description alone is not enough to constitute a 
narrative; narrative for its part does not exclude description" (Todorov 1990: 
28). Narrative texts are those where the distinctive characteristics of the narra- 
tive genre are prevalent (Bal 1977:13). 

Bal correctly points out that a theory of narrative presupposes a theory of text 
genres; she proposes three types of texts, lyrical, dramatic, and narrative (Bal 
1977:12—13). The narrative text basically tells a story (just like drama, but un- 
like lyrical texts) in a complex way (contrary to the other two types of texts) 
wherein the ratio of narrator's discourse to actor's discourse is maximized 
(contrary to both lyrical and dramatic texts). “In modern theories of literature," 
states Bal, “description occupies a marginal role. The structural analysis of 
narrative relegates it to a secondary function: It is subordinate to the narration 
of action. It can occupy the catalyser function, but never that of nucleus" (Bal 
1977:89). Description is a luxury, a narrative ornament (Bal 1977:89—90). 


FURTHER FORAYS INTO A STRUCTURAL 
ANALYSIS OF NARRATIVE 


“A narrative ... shares with other narratives a common structure which is open 
to analysis, no matter how much patience its formulation requires" [Barthes 
1977:80 (1966)]. We have seen the results of that collective exercise in pa- 
tience in search of “the invariant structural units which are represented by a va- 
riety of superficial forms" (Labov & Waletzky 1967:12); ofthe recurrent char- 
acteristics and the "distinguishable regularities" behind narrative (Greimas 
1971:794); behind the “millions of narratives" [Barthes 1977:81 (1966)] to be 
more precise. 

Propp took the first bold step toward a structural analysis of narrative when 
he identified an invariant pattern of 31 functions behind the large variety of 
Russian folktales. Regardless of the particular content of a folktale, regardless 
of how the story is told, all Russian folktales, according to Propp, will exhibit 
(atleast some of) those 31 basic functions. Furthermore, the sequence in which 
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those functions appear is fixed.’ Greimas further aggregated Propp's 31 func- 
tions into a basic set of six functions (Greimas 1966). Labov found a six-part 
macrostructure in vernacular narratives of New York Harlem African-Ameri- 
cans: abstract, orientation, complicating action, evaluation, result or resolu- 
tion, and coda (Labov 1972:362—70; see also Labov & Waletzky 1967).8 More 
generally, van Dijk has argued that all texts are characterized by macrostruc- 
tures ("schemata") that provide the "global schematic form" of a dis- 
course—different discourse genres being characterized by different schemata 
(van Dijk 1983:24). Thus, the schema of a newspaper article comprises both a 
summary and a story; the story further comprises a situation and comments; 
the situation comprises episode and background; the episode includes main 
events and consequences; while background includes context (circumstances 
and previous events) and history (van Dijk 1988:51—59). 

Of particular interest is the relationship between these deep, macrolevel 
structures and surface, microlevel structures. For Labov (1972:375), narrative, 
macrolevel structures can be mapped into surface structures through adverbial 
elements (for time and space), a subject-noun phrase, and a verb phrase. Other 
linguists, as well, have argued that narrative stories are characterized by a sim- 
ple surface representation patterned after the canonical form of the language 
(subject/action/object) with some modifiers (Todorov 1969:27-41; 1977: 
218-33; 1981:48—51; Halliday 1970; Chatman 1978:91; Prince 1973:32). 
Cognitive psychologists and computer scientists involved in artificial intelli- 
gence projects of computer understanding of natural languages have similarly 
represented stories in terms of a “story grammar” (e.g. Rumelhart 1975; Man- 
dler 1978). 

Over the last decade, sociologists have proposed various methods of analy- 
sis of narrative texts that are fundamentally based on these and other structural 
linguistic characteristics of narrative. Abell (1987), for instance, has proposed 
a methodology that he terms “comparative narratives” based on a formal repre- 
sentation of narrative structures in terms of actors and actions. In my own 
work, I have developed a story grammar (or semantic grammar) to structure 
the narrative information provided by newspaper articles on protest events 
(Franzosi 1989). A semantic grammar offers several advantages over more tra- 
ditional content analysis schemes for the collection of text data (Franzosi 
1989; 1990a,b). In particular, the re-expression of the grammar in set theoreti- 
cal terms allows researchers to go “from words to numbers," i.e. to analyze 


TFor a critique of this aspect of Propp’s morphology of stories, see Bremond (1964). Bremond 
argues that stories are not all characterized by a single, invariant macrostructural sequence. Rather, 
invariant sequences are characteristic of microstructures (1€ parts of stories) and those sequentially 
invariant microstructures combme in a multipheity of forever varying ways to form complex 
narratrve macrostructures. 
8For a comparison of Greimas’ and Labov & Waletzky’s schema, see van Dyk (1972:293) 
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narrative information statistically (Franzosi 1994). Furthermore, within a set 
theoretical notation, the basic structure of a semantic grammar translates easily 
into the mathematical structure underlying network models (Franzosi 1998). 
Intuitively, a semantic grammar structures narrative information within the ba- 
sic template SAO, or subject, action, object, where both subjects and objects 
are typically social actors (e.g. "police charge demonstrators”). In other words, 
the basic structure of a semantic grammar links social actors around specific 
spheres of action. Network models can be used to analyze text data organized 
not only in SAO structures, but also in sets of relational concepts. Concepts, 
rather than social actors, represent the nodes of these networks (for this ap- 
proach, see Carley 1993). 

Other approaches proposed by sociologists similarly take off in fundamen- 
tal ways from structural characteristics of narrative. Abbott (1995), for in- 
stance, investigates the sequential organization of narrative structures in 
search of patterns of recurrent sequences. "The sequence of events has its own 
laws," wrote Propp long ago. "Theft cannot take place before the door is 
forced. Insofar as the tale is concerned it has its own entirely particular and 
specific laws. The sequence of elements ... is strictly uniform [emphasis in 
original]. Freedom within this sequence is restricted by very narrow limits 
which can be exactly formulated" (Propp 1968:22). Abbott & Hrycak's 
method of analysis allows investigators to bring out those uniform sequences 
(and not just in narrative) (Abbott & Hrycak 1990). 

Even more closely related to the structural characteristics of narrative is a 
method of analysis of text data proposed by Heise (Heise 1989; Corsaro & 
Heise 1990). Following the long philosophical and linguistic tradition of the 
difference in meaning between “things [that] happen because of one another, 
or only after one another," of dynamic and static motifs, of cardinal functions 
(or nuclei) and catalyzers, of kernel and satellite events, Heise developed a 
computer program, ETHNO, that forces investigators to make explicit the im- 
plicit assumptions built into causal arguments, as these arguments are reflected 
into the chronological sequence of skeleton narrative sentences. Griffin's 
analysis ofthe narrative of a lynching event that took place in Missisipi in 1930 
(Griffin 1993) shows that Heise’s approach does help to tease out the implicit 
and explicit causal patterns of a narrative. Furthermore, the approach helps to 
bring out research questions and broad patterns of social relations by focusing 
on a single narrative. 

What is characteristic about these new techniques is that their real contribu- 
tion does not seem to lie so much in the methodological but in the epistemo- 
logical realm. As Abbott argues with reference to sequence analysis, sequence 
analysis is not just “a particular technique [of data analysis, but] ... rather a 
body of questions about social processes” (Abbott 1995:93). No doubt, a view 
of social reality fundamentally based on narrative data shifts sociologists’ con- 
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cerns away from variables to actors, away from regression-based statistical 
models to networks, and away from a variable-based conception of causality to 
narrative sequences. That view promises to bring sociology closer to history 
and to sociology's own original concerns with issues of human agency. It also 
blurs the line between quality and quantity, transcending the terms of a debate 
that has uselessly involved social scientists over the last 50 years (see Abell 
1987:3—12). But whether a technique is worth its salt (particularly when it 
claims to rise above the methodological realm and into the epistemological 
one) fundamentally depends upon the substantive products it delivers. And as 
of yet, the researchers involved in peddling these wares have delivered sub- 
stantive products of limited import (for some examples, see Abbott & Barman 
1997; Franzosi 1997a,b; Griffin 1993). More to the point, these researchers 
have been less than forthcoming in spelling out the methodological and episte- 
mological limitations of their techniques (see Franzosi 1999). Semantic gram- 
mars, they tend to mix syntactical and semantic categories (e.g. subject and ob- 
ject are syntactical categories—actor would be a better semantic alterna- 
tive—but action is a semantic category; verb would be the syntactical equiva- 
lent). Furthermore, the processes that actors perform are broader than those 
implied by the term action. The high costs involved in the collection of event 
data may lead researchers to focus on particuarly transformative events (e.g. 
“the red years" and “black years" of 1919—1922 of my work, or Tarrow's focus 
on the cycle of protest around the Italian “hot autumn" of 1969; see Franzosi 
1997b, Tarrow 1989). The event is back, from the ashes of Braudel's attack 
and that of his colleagues in the Annales school. Finally, the very richness of 
the event data that computerized semantic grammars deliver may lead re- 
searchers to adopt descriptive modes of explanation, the narrative of the evi- 
dence imposing its form on the mode of explanation. In going “from words to 
numbers" we may have inadvertently gone from "thin explanations" (based on 
variables and regression models) to “thick descriptions." The application of 
network models to narrative data may not necessarily improve things on this 
score. Laumann wrote that “the hallmark of network analysis ... is to explain, at 
least in part, the behavior of network elements (i.e. the nodes) and of the sys- 
tem as a whole by appeal to specific features ofthe interconnections among the 
elements" (Laumann 1979:394). Such emphasis on explanation (rather than 
description) is certainly part and parcel of our disciplinary mottos and of the 
legacy bequeathed to us by our "founding fathers." But a focus on the system 
of interconnections alone is unlikely to lead us beyond the descriptive. 

With respect to Heise and Griffin's approach, their method leaves unre- 
solved the issue of how investigators reach fundamental decisions about what 
is sequential and what is consequential. For all the logical questions that 
ETHNO asks, for all its probing, the decision-making process itself remains a 
"black box" hidden in the recesses of the human psyche. That being the case, 
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and given that ETHNO has been applied to understand short narrative pas- 
sages, why would a good linguist or a good historian not reach those same con- 
clusions without ETHNO? (For a critique of one of Abbott & Bauman's contri- 
butions, see Franzosi 1997c.) 


AN EXAMPLE 


Consider the following text. 


Neville: After my wife kicked me out I spent several weeks living 1n my car 
Being homeless she wouldn't let me see my son ... I really missed Ricky. A 
friend suggested I go to see Shelter. I was a little apprehensive ... frightened 
to go in, but they were brilliant. It’s a bit like a hotel, it’s very clean and the 
staff are great. Best of all my wife came round to check the place out and now 
lets my son visit me, it's let me rebuild me family life... (Oxford Inde- 
pendent, 1997) 


It is not hard to recognize this text as narrative. The text deals with “the temporal 
character of human experience” (Ricoeur 1984:52), with a change in situation 
from bad (homelessness and breakdown of family life) to good (a place to live 
and the rebuilding of family life), and contains both narrative and non-narrative 
(mostly descriptive) clauses arranged in chronological order (see Table 1). 

Many temporal references in Neville's story highlight the role of narrative 
as "recapitulation of past experience" (Labov 1972:359). But time in narrative 
has a dual function: It “is constitutive both of the means of representation (lan- 
guage) and of the object represented (the incidents of the story). Time in narra- 
tive fiction can be defined as the relations of chronology between story and 
text" (Rimmon-Kenan 1983:44). In Neville's story, the sequence of clauses 
coincides with the sequence of the narrated events (the sequences of both 
clauses and events are the same; see columns 1 and 2 in Table 1). Story and plot 
coincide, with minimum plot development. That is rather typical of simple sto- 
ries. The plot is the realm where the narrative abilities of different authors can 
make something out of the basic raw material ofa story (a sequence of events). 
Narrative theorists have highlighted three different aspects of narrative 
time—order, duration, and frequency—each dealing with three different sets 
of questions: When? For how long? And how often?? 

Neville's story does not provide enough clues for a clear answer to the first 
question: When? Although extranarrative information tells us that the story 


The most comprehensive account of narrative time is Genette's treatment (Genette 1980). 
Ricoeur’s three-volume treatment 1s more ambitious (Ricoeur 1984, 1985, 1988), but not as crisp as 
Genette's. Ricoeur’s book deals with linguists’, philosophers’, and historians’ views of time It 
provides a comprehenstve review of each author’s position in lucid, clear language. For bnef 
introductions to the issues, see Rimmon-Kenan (1983-43-58), Cohan & Shires (1988.84—89), 
Toolan (1988°48-61) 
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Table1 Narrative and non-narrative clauses in Neville's story* 









* 
* 





(01) Ti After my wife kicked 


a 
« 
he 





CM 


me out 
(02) T2 I spent several weeks 
living 1n my car 

(03) Being homeless 
(04) she wouldn't let me see 

my son 
(05) I really missed Ricky 
(06) T3 À friend suggested 
(07) T4 I go to see Shelter 
(08) ] was a little apprehen- 

sive... 

(09) frightened to go in 
(10) but they were brillant 
(11) It’s a bit like a hotel 
(12) it's very clean 
(13) and the staff are great 
(14) q^ Best of all my wife 

came round 
(15) to check the place out 
(16) Te and now lets my son 

visit me 
(17) it’s let me rebuild me 

family life 


*The distinction between "narrative" and "descriptive" units in Table 1 1s based on the linguis- 
tic properties of the verb. Typically, the narrative characteristics of a story are Imguistically 
marked by the use of (a) finite (rather than non-finite) verbs (e.g. walks or walked, rather than to 
walk) and (5) dynamic verbs that depict events and active processes, rather than stative verbs that 
describe states of affairs or descriptions (Prince 1973:29, Chatman 1978.31-2, Toolan 
1988:34—5, 266). "Process statements are 1n the mode of DO or HAPPEN.. . Stasis statements are 
in the mode of IS" (Chatman 1978:32) Yet, some static verbs in the mode IS hide process state- 
ments in the mode of DO A good example is clause 10, "they were brilliant," which implies that 
the Shelter staff DID something to make Neville feel welcome, or comfortable and at home. 

Also, strictly speaking, the numbered text units are not clauses (e.g. clause 4), although most of 
them are. The discourse analysts’ terminology of “discourse units” may be more appropriate I 
have kept the term "clause" to emphasize Labov's separation of narrative and descriptive, or 
"free," clauses (Labov 1973). 


was published in January 1997, it does not tell us when exactly the narrated 
events took place. The deictic element “now” temporally anchors the story to 
the moment of narration—Trather than to a moment prior to that of narra- 
tion—but that narration could have occurred at any time in the past prior to 
publication. The temporal shift from the past tense of most clauses in the story 
(“kicked out,” “spent,” “missed,” “suggested”) to the present tense of clauses 
11-13 is in line with the descriptive character of those clauses and with the 
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static nature of those verbs. As for duration, we know that Neville spent sev- 
eral weeks living in his car, but we do not know how long he has been living in 
Shelter or how long his son's visits are. Finally, we do not know, for instance, 
how often Ricky visits his father Neville at Shelter. In clause 4, the aspect of 
the verb wouldn't let me" indicates a repetition of the action (habitual action), 
a denial protracted in time. 

The narrativists' duration and frequency do not just refer to the duration and 
frequency of the narrated (real-life) events. More generally, duration and fre- 
quency refer to the relationship between narrative clauses and narrated events. 
[On these points, see Genette (1983:33—160); Rimmon-Kenan (1983:43—58); 
Cohan & Shires (1988:84—89); Toolan (1988:48—62).] A narrator can sum up 
in one sentence events that took place over a long period ("After the fall ofthe 
Roman empire..."), or dwell for many pages on fleeting events lasting a few 
minutes [Geertz's thick description (1973:3—30)]. Neville dedicates the same 
textual duration (one clause) to events of unequal temporal duration (“I spent 
several weeks living in my car" and “my wife came round”). Similarly, a narra- 
tor can recount the same event several times (frequency). In the film Ground- 
hog Day the events occurring in one day in the life of a TV news reporter are 
told several times as that day is relived over and over by the protagonist. 

The narrative “game with time" (Genette 1980:155) is by no means con- 
fined to purely fictional narrative. Claude Lévi-Strauss has argued that chro- 
nology is the distinctive characteristic of history: Without chronology 
("dates"), there is no history [Lévi-Strauss (1972:258—60); see also Barthes 
(1970); White (1987:1—25)]. But different periods of history are characterized 
by different densities of dates. There are "hot" chronologies such as World 
War II, where the historian closely follows the events day by day, hour by 
hour, and chronologies where the historian quickly jumps over long spans of 
thousands of years. This selection of dates and events (the “facts” of the histo- 
rian), these narrative strategies are not "innocent." They reflect the (conscious 
and unconscious) intentions of the historian; they serve a fundamental ideo- 
logical function (Barthes 1970; White 1987:35). That is true for even the sim- 
plest of historical work—the chronicle. Medieval chroniclers listed events se- 
quentially, year after year, one event per year, one year per line, a blank entry 
for many years. Furthermore, events varied in duration from "Pippin, mayor of 
the palace, died” (714 AD) to "Theudo drove the Saracens out of Acquitaine" 
(721 AD) (White 1987:6—9). Yet, even this seemingly random and bizarre se- 
lection of facts—"712 flood everywhere; ... 722 Great crops; ... 725 Saracens 
came for the first time" (White 1987:7)— makes sense in light of the authors' 
fundamentally religious weltanschauung where only the “other world" counts 
and God's intentions for this world are inscrutable. 

The choice of "facts" in Neville's narrative and the lack of correspondence 
between narrative duration and the duration of narrated events perhaps points 
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to an ideological bias in Neville's narrative. Take Neville's dramatic entry into 
his story in the first clause: “After my wife kicked me out...." The clause does 
not have independent status. It is "backgrounded" with respect to the "fore- 
grounded" main clause of the sentence, “I spent several weeks living in my 
car." (See Hopper 1979 on backgrounding and foregrounding.) As such, it 
downplays the event of being kicked out of the house, and up-plays the state of 
homelessness. Yet the clause is highly marked. It is the very first clause of the 
story. Neither in the first sentence nor in subsequent sentences does Neville tell 
the reader why his wife kicked him out. His narrative does not focus on the rea- 
son and causes of an action that has tragic consequences for his life. Rather, it 
focuses on those tragic consequences. Through careful backgrounding and 
foregrounding, through silence on causes and emphasis on consequences, 
Neville draws the reader's attention to Neville's point of view, rather than the 
wife's (indeed, we read “me,” “me,” “me,” “my,” “my,” “my” throughout the 
story). It provides a male's, rather than a female's reading of the events. 

The second sentence in the text is similarly introduced by a clause with no 
independent status: "being homeless." The participial form syntactically intro- 
duces a causal argument, without, however, explicitly stating causation (“be- 
cause I was homeless”). As a result, subsequent actions are less clearly conse- 
quential. The aspect (^wouldn't") of the verb “let” indicates not only a habitual 
response (i.e. throughout this period of Neville living in his car), but also a vo- 
litional response (she did not want Neville to see his son—his son!). Back- 
grounding and foregrounding of clauses 1n the second sentence make the ac- 
tions of Neville’s wife appear unmotivated and mean-spirited. 

Using similar rhetorical devices of silence and emphasis, of backgrounding 
and foregrounding, the media provide ideologically biased readings of social 
relations (Eco 1971). In one of the few systematic analyses involving English 
mainstream and radical papers, Hartmann (1975/1976) shows that the Morn- 
ing Star, the daily newspaper of the British Communist Party, pays much more 
attention to the causes of industrial action than its consequences on the econ- 
omy or on the public. According to Downing, the “refusal to explain the roots 
of any strike, ... [and the focus] on its disruptive effects, ... [makes] the strikers’ 
decision all the more inexplicable and unforgivable” (Downing 1980:47). 

The research of the Glasgow University Media Group on British television 
coverage of strikes highlighted similar patterns. The TV news reporting of the 
1975 Glasgow garbage collectors' strike focused on the health hazard posed by 
the strike (the "effect" ofthe strike) with minimal attention paid to the reasons 
for the strike (Glasgow University Media Group 1976:244, 249, 253). Mur- 
dock's analysis of media coverage of radical groups shows that those same 
mechanisms of systematic bias operate in the media against other groups 
(Murdock 1973). Murdock writes, "Attention is (then) directed away from the 
underlying issues and the definitions of the situation proposed by radical 
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groups, and fixes instead on the forms which this action takes. The ‘issue’ 
therefore becomes one of the forms rather than causes" (Murdock 1973:157). 
His case study of a mass demonstration against the Vietnam War held in Lon- 
don on 27 October 1968 shows how media emphasize form over content, how 
"the underlying causes of why there was a march to begin with” are neglected 
(Murdock 1973:160). Van Dijk's work on the role of the Dutch media in the re- 
production of racism basically tells the same story with regard to ethnic mi- 
norities (van Dijk 1987:210, 218). “Topics are dealt with in terms of ‘prob- 
lems’ and from the point of view of the authorities" (van Dyk 1987:210). “The 
causes or context of such problems are seldom analyzed in the press, and 
hardly ever explained in terms of White racism" (van Dijk 1987:218). 
Careful backgrounding and foregrounding of information can certainly go 

a long way in ideologically coloring a text. But that ideological color can also 
be made quite explicit. According to Labov, the sequence of purely narrative 
clauses performs the referential function of narrative. Basically, that function 
deals with the question: What is the story? But narratives are also character- 
ized by a second function—the evaluative function—which deals with the 
question: Why is the story told? (What is the story's point? See e.g. Toolan 
1988:147.) A typical story will contain explicit evaluative statements that re- 
veal the teller's attitudes to the events recounted. Many of the narratives col- 
lected by Erikson, after the Buffalo Creek flood that wiped out an entire min- 
ing community, express sentiments of anger towards the Pittston Mining Com- 
pany: 

“We have bitterness toward the coal company ” 

“They washed us out, killed all those people.” 

“I don’t think they thought of us as human beings." 

“All I can call the disaster 18 murder.” 

“The coal company knew the dam was bad, but they did not tell the people.” 

“All they wanted was to make money ” 

“They did not care about the good people that lived up Buffalo Creek...” 


(Enkson 1976:183) 


Indeed, the sense of moral outrage toward corporations and the profit mo- 
tive may well be the point of the stories of at least some of the survivors. But 
stories do not necessarily have only one point. The Buffalo Creek survivors, 
while blaming the company for such devastating tragedy, may also want to 
convey to city people who have flocked to Buffalo Creek after the disaster 
what it means to have lost not just someone close to kin, but a community: 

“Tf you had problems, you wouldn’t even have to mention it.” 

“People would just know what to do. They’d just pitch in and help.” 
“Everyone was concemed about everyone else.” 

“Well, I have lost all my friends. The people I was raised up and lived with, 
they’re scattered I don’t know where they are at. I’ve got to make new 
friends, and that’s a hard thing to do...” (Erikson 1976:190—91) 
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The more complex the story, the more likely that it will explicitly satisfy the 
evaluative function with one or multiple story points. Children's ability to tell 
stories increases with age—simple chronological stories being characteristic 
of children less than four years of age. [For a brief review of children's narra- 
tive development, see Toolan (1988:193—202).] Between the ages of four and 
nine, children steadily improve their narrative competence: (a) The weight of 
the evaluative function increases at the expense of the purely narrative func- 
tion—that function being marked by specific evaluative clauses; (5) stories be- 
come longer and plots more elaborate (older children present the events in the 
plot in a different sequential order than in the story); (c) direct speech or free 
indirect speech, completely absent in younger children, becomes more com- 
mon with age; (d) main characters become “rounder,” to use Greimas’ expres- 
sion; (e) stories become more coherent and causal arguments more explicit. 

Neville's story presents none of the characteristics associated with high 
narrative competence. His story does not contain explicit evaluative clauses. 
That does not mean, of course, that there 1s no evaluation in Neville's account 
of a particular moment of his life. As Skinner has argued, the process of pro- 
duction of text is always inextricably linked to the production of ideology.19 
Even the referential function of narrative—the chronological sequence of nar- 
rative clauses—is not devoid of evaluation. Purely narrative sequences are 
never innocent. Narrative sequences imply causal sequences. As Aristotle 
wrote in the Poetics, *It makes a great difference whether things happen be- 
cause of one another, or only after one another" (emphasis in the original) 
(Halliwell 1987:42). Todorov made that point explicit: *Most works of fiction 
of the past are organized according to an order that we may qualify as both 
temporal and logical; let us add at once that the logical relation we habitually 
think of is implication, or as we ordinarily say, causality" (Todorov 
(1981:41).!! Certainly, Neville’s story suggests (or is at least compatible with) 
the following causal proposition: Because a wife kicks out a husband, the hus- 
band ends up homeless. There is some truth to that. In a survey of three differ- 
ent populations of single homeless people conducted in England in 1991, the 
breakdown of a relationship is the main reason for homeless persons leaving 
their last home (Anderson et al 1993:70—73).!7 But that, of course, is dodging 
the question: Why did Neville's wife kick him out in the first place? After all, 
the in-depth study of 30 newly separated mothers in the Nottingham and Derby 


lÜSee the essays collected in Tully (1988) 
l1On the relationship between causahty as chronologically ordered sets of propositions, see 
also ‘Rimmon-Kenan (1983 17-19), Cohan & Shires (1988 17, 58) 
does not in itself explain why people who have left their homes become homeless. In 
Britain this 15 primarily due to a shortage of affordable housing, so that people in low-paid jobs, the 
unemployed, and those with mental health, alcohol or drug abuse problems are at risk of 
homelessness if they lose their accommodation following the breakdown of a relationship. 
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areas shows that the most frequently reported reason for the breakdown of a re- 
lationship is the man's violence and adultery (Leeming et al 1994:19). Again, 
silence on causes and emphasis on consequences shifts the blame away from 
Neville and towards his wife.!? 

The 1mplicit causal reading of Neville's narrative does give us some clues 
on how to interpret his story and his intentions—the story point. Perhaps Nev- 
ille is suggesting that women are to be blamed for men's misfortune. The text, 
then, is a political manifesto against women. Both the causal arguments im- 
plicitly built in Neville's narrative and the subtle backgrounding and fore- 
grounding of information point to that reading of the story point. Further evi- 
dence for that interpretation of the text comes from character traiting—the 
positive or negative portrayal of actors—and the attribution of actions to ac- 
tors. There is minimum characterization of dramatis personae in Neville's 
story.1^ With the exception of Shelter, all actors—Neville himself, the wife, 
his son Ricky (indeed, always referred to as “my” son), the friend, Shelter 
staff—are human. Interestingly enough, and perhaps no differently from fairy- 
tales where the giver/sender in Greimas’ model is often nonhuman and en- 
dowed with magical powers, the giver/sender is not a human being but an or- 
ganization (Shelter). It is adjectives that provide basic information on charac- 
ter traiting (Todorov 1969:31). 

For Chatman, “A trait may be said to be a narrative adjective ... labeling a 
personal quality of a character" (Chatman 1978:125). In qualifying the word 
"adjective" with the word "narrative," Chatman is stressing the fact that char- 
acter traiting (or the adjectival function of narrative) does not necessarily fall 
on adjectives exclusively. Again, there is minimum use of adjectives in Ne- 
ville's narrative for the purpose of character traiting. Of the five adjectives 
found in the text (apprehensive, frightened, brilliant, clean, great), however, 
two (apprehensive and frightened) are used by Neville to portray himself and 
the other three (brilliant, clean, and great) are lavished upon Shelter. While the 
first two adjectives describe negative psychological and emotional states, the 
other three are either hyperbolic (brilliant, great) or, when they are simply 
positive (clean), they are strengthened by a quantifier (very). Again, Shelter 
emerges as the real deus ex machina in Neville's narrative. In fact, setting him- 


I3The issue of narrative truth ıs not a linguistic problem. It becomes a problem when narrative is 
used to index social reality It 18 a problem for historians. For Ranke, one of the “founding fathers” 
of the historical profession, history must relate things the way they actually happened (wie es 
eigentlich gewesen)—the historians’ "noble dream” of objectivity (Novick 1988) It is a problem 
for ethnographers who draw broader inferences from informants’ narratives—narratives always 
subject to selection, distortion, interpretation, evaluation (for a brief 1ntroduction to the issue of 
narrative truth, see Kohler Riessman 1993:21—23) 

14We owe to Greimas the distinction between flat and round characters, characters with 
minimum characterization and characters presented with a relative degree of complexity and depth 
(Rimmon-Kenan 1983-40, Bal 1977.31). 
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self up as a “little guy” (frightened and apprehensive) helps Neville build up 
Shelter, by contrast. 

In the absence of explicit character traiting, we can also infer character from 
action (Rimmon-Kenan 1983:60—61; Toolan 1988:102). Thus, the dramatic 
opening of Neville's story "After my wife kicked me out" implicitly points an 
accusatory finger towards the wife. After all, as far as we know, Neville did 
nothing to deserve this. In his narrative, Neville does not volunteer any infor- 
mation on the reasons why he was kicked out of the house. We are left to imag- 
ine: Because he abused his wife, because he refused to carry any responsibility 
around the house, because he had affairs, because he was a drunk or a drug ad- 
dict... In fact, the violence perpetrated against Neville in the opening 
clause—violence all the more senseless and gratuitous because it has no (nar- 
rative) explanation—helps to bring Neville into focus as a victim rather than a 
villain. We feel sorry and we sympathize for victims, while we are repulsed by 
villains. Further clues help to conjure up positive images about the moral fiber 
of Neville—after all, a man who ("really") misses his son, who has strong aspi- 
rations to family life, who appreciates cleanness, and who has fnends.!? 
Rather than a brazen and insolent villain, this narrative presents before us a 
man deserving of our pity—homeless, harmless, and deeply hurt in his pater- 
nal feelings (if not 1n his masculinity altogether). And so it 1s. We do extend 
him our pity. At which point, with the reader on his side, Neville, in the closing 
lines of his narrative, can even afford a more sympathetic portrayal of his wife. 
Again, characterization is not direct; we have to infer character from action. 
But the actions of Neville’s wife in the last clause point to a caring mother, 
who, having the well-being of her son at heart, comes around to check Shelter, 
and then lets Ricky visit his father. 

Whatever Neville's intentions, no doubt, the wife's actions associate her to 
a strong character: She kicks out, she would not let (combining volition and 
permission), she comes around, she checks, she lets. Indeed, these are actions 
of authority and power. By contrast, both directly and indirectly, Neville 
comes across as a weak character. Directly, Neville describes himself as appre- 
hensive and frightened. Indirectly, we can infer who Neville is not only from 
what he did, but from what he could have done and did not do. Neville appears 
to passively accept his wife's authority and power. His story offers no signs of 
protest on his part to his wife's most damaging decisions: of kicking him out of 
the house and of not allowing him to see his son. He could have kicked the wife 
out. He could have argued and screamed, perhaps even resorted to physical 
violence. He could have run away with his son. He could have engaged a legal 
battle with his wife. 


I5Rossi's data on Chicago homeless show that homelessness does not necessarily imply 
complete loss of friendship (or family) ties (Rossi 1989:173—7). 
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He could have, but he did not. Why not? “The human capacity to withstand 
suffering and abuse is impressive," writes Barrington Moore (1978:13) in his 
Injustice: The Social Bases of Obedience and Revolt. It certainly helps if the 
sufferers believe in the fundamental justice of their suffering. [See Barrington 
Moore's discussion on responses of moral outrage or moral submission to in- 
justice (Moore 1978:49-80).] Does Neville believe that he deserves having 
been kicked out of the house? And does he hold that belief because his wife's 
action was prompted by something he did and that he himself perceives as 
morally wrong—a breach of the social contract—or because he "take[s] pride 
and pleasure in ... [his] pain" (Moore 1978:50) and suffering, like religious as- 
cetics or Indian Untouchables? 

We do not know the answer to those questions. But the text does present 
an interesting reversal of stereotypical gender roles of male aggressiveness 
and female passivity. That is particularly true if we make the reasonable as- 
sumption that Neville is a member of the lower classes. As Bourdieu shows, a 
conception of masculinity in terms of toughness, virility, physical strength 
(with "enormous, imperative, brutal needs") is much more likely among 
working-class men (Bourdieu 1984:192; also 190, 382). There is nothing in 
Neville's traiting of his wife's decisiveness. Being apprehensive, frightened, 
and generally hesitant (as underscored by the choppy character of the narra- 
tive at this point—see the dots which imply a temporal pause in Neville's talk 
in clause 8) is not a traditional masculine trait. When considering that in Nev- 
ille's story the wife is the prime mover, the initiator of a temporal sequence 
of clauses that, in the end, sees Neville in a homeless state, perhaps Neville is 
telling his fellow readers that his wife has emasculated him, broken him 
down as a man. That is something, of course, that no patriarcha] society can ac- 
cept. Peasant communities across Europe for centuries collectively "took 
care" of offenders of traditional community norms through public rituals 
known as “rough music" or charivari (Thompson 1993:467—538). Women 
who broke patriarchal values—the scold, the  husband-beater, the 
shrew—were a prominent target in these rituals (Thompson 1993:476, 493). In 
an 1838 description: 


When any woman, a wife more particularly, has been scolding, beating or 
otherwise abusing the other sex, and is publicly known, she is made to ride 
stang. A crowd of people assemble towards evening after work hours, with 
an old, shabby, broken down horse. They hunt out the delinquent ... and 
mount her on their Rozinante . astride with her face to the tail. So they pa- 
rade her through the nearest village or town; drowning her scolding and 
clamour with the noise of frying pans &c.... (Thompson 1993:499) 


Whether Neville believes his wife's actions to be just or unjust, the lack of 
resources would certainly put a limit to the available courses of action. If Nev- 
ille were poor he could hardly engage his wife in a protracted legal battle over 
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their son. Gamson et al (1982:60—64) argue that the range of possible re- 
sponses to unjust authority vary from compliance to evasion, rim talk, resis- 
tance, direct action, and preparation for future action. But whether social ac- 
tors chose one form of reaction or another in their encounters with unjust 
authority ultimately depends upon the availability of resources. Scott's (1985) 
study of Malaysian peasants makes that point quite clear: The relative lack of 
resources does not allow the peasants to take overt reactions. ^Most forms of 
this struggle stop well short of outright collective defiance. Here I have in mind 
the ordinary weapons of relatively powerless groups: foot dragging, dissimu- 
lation, desertion, false compliance, pilfering, feigned ignorance, slander, ar- 
son, sabotage and so on” (Scott 1985:xvi). Newby, in his study of East Anglian 
peasants, similarly noted: "The agricultural worker has acknowledged his 
powerlessness and decided to make the best of his inferior situation, contriving 
to take it somewhat for granted while not necessarily endorsing it in terms of 
social justice" (Newby 1979:414). 

We do not know Neville's thoughts on the justice/injustice of his wife's 
actions. But we can at least ask the question: What resources does Neville 
command? More generally, what do we know about Neville? Who is Neville? 
In one of Grimm’s classical fairy tales, “Roland and Maybird,” the parents of 
the two children abandon them in the thick of the woods. Cleverly, Roland 
leaves behind a trail of white pebbles that will help him find the way out. Nev- 
ille has also left behind a trail of clues that help us understand who he is. We 
know that Neville is likely to be a young man. After all, Ricky, his son, is still a 
little boy in need of his mother's permission to see his father. Perhaps, he is 
also white. The majority of homeless people surveyed by Anderson et al in 
England were white, male and in the age group 25—44 (Anderson et al 
1993:7—9). 

We also know that Neville is likely to be poor. If he were not, he could have 
moved into a hotel or into a place of his own after being “kicked out.” There is 
no mention of any financial transactions between Neville and his wife. Is Nev- 
ille contributing child support? Unlikely, in his current situation; in fact, un- 
likely in any situation from what we know of divorced men's contributions to 
child support. Most ex-partners provide no child support to newly separated 
mothers (Leeming et al 1994:55). 

We do not know whether Neville is also unemployed, although he is quite 
likely to be. In the 1991 English survey of homeless by Anderson et al, fewer 
than 1096 of the homeless people surveyed were employed (Anderson et al 
(1993:15—16; see Rossi 1989:135 for similar data on Chicago homeless). But 
if he is gainfully employed it is likely to be in a low-pay, low-status occupa- 
tion. How could he hold onto a more demanding job while living for weeks in a 
car? Rossi's data on Chicago homeless make it clear how difficult it must be to 
keep up appearances while living in the streets: Only 27.596 of the respondents 
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living in the streets appear “neat and clean” to the interviewer vs 70.5% of 
those living in shelters (Rossi 1989:93). In England, according to the survey 
conducted by Drake et al, 65% of the currently unemployed homeless were 
semiskilled or unskilled in their last job (Drake et al 1982:26, 29). As for their 
current job, when the homeless do work, they do so prevalently in low-pay, 
low-status, casual jobs (Anderson et al 1993:15—16).16 

Not only the facts of Neville's story tell us something about his social back- 
ground. Purely linguistic markers also help to identify Neville as a member of 
speech communities more typically found among the lower ranks of British so- 
ciety. There is some evidence of a ^women's language" (Lakoff 1975) or, more 
generally, a “powerless language" (O'Barr & Atkins 1980). [However, see the 
strong critique by Coates (1993:132—35).] That language is characterized by: 
(a) hypercorrect grammar; (5) higher frequency of accompanying gestures, 
hedges (“well,” “I guess"), and intensifiers (e.g. “so,” “very”); (c) “empty” ad- 
jectives (“‘divine,” “cute,” “charming,” “ghastly”); (d) polite forms (“would you 
please ...?”); (e) rising (question-type) intonation in declarative sentences; (f) 
greater pitch range and more rapid pitch changes; (g) tag questions ("isn't it?”’) 
(Lakoff 1975:53—57; O’Barr & Atkins 1980; Toolan 1988:250; Coates 1993: 
132-33). Linguistic research on courtroom transcripts has shown that defen- 
dants and witnesses with professional qualifications or positions of authority 
tend to use a language of power (see Toolan 1988:249—55 for a review). Fur- 
thermore, those defendants and witnesses who use a language of power tend to 
be perceived by judges, juries, and lawyers as more knowledgeable, more 
credible, and more trustworthy than those who use a powerless language. 

In Neville's language we find many of these markers of powerless lan- 
guage. We find hedges (the dots in clause 8 are a marker of hesitation and un- 
certainty, further underscored by the adjectives “apprehensive,” frightened"), 
empty adjectives (“brilliant,” "great"), and intensifiers (“really missed Ricky” 
in clause 5; “very clean” in clause 12). What Neville finds attractive about 
Shelter (“clean” and “like a hotel") is also more typical of working-class indi- 
viduals. In his study of taste, Bourdieu provides quantitative evidence on the 
relationship between socioeconomic background and images of an “ideal 
home” (Bourdieu 1984:247-48). “The proportion of choices emphasizing 
overtly aesthetic properties (studied, imaginative, harmonious) grows as one 
moves up the social hierarchy, whereas the proportion of ‘functionalist’ 
choices (clean, practical, easy to maintain) declines" (Bourdieu 1984:247). 

The text does not provide any information on posture, gestures, gaze, or 
voice pitch. That is unfortunate, because conversation analysts and ethnometh- 


16The most common form of income for homeless people was Income Support or other state 
benefits with an average total income of £38 per week (Anderson et al 1993:15—16). On the income 
of homeless people in the US, see also Rossi's data on the Chicago homeless (1989:136) 
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odologists have shown how telling those extralinguistic characteristics can be 
(Maynard 1992; see also Coates 1993:188, 197—98; Kohler Riessman 
1993:40).!7 Silences may speak even louder than words, as shown by the work 
by West & Zimmerman (West & Zimmerman 1980, 1983; see also Coates 
1993:107—14) on gender differences in patterns of silences and interruptions in 
everyday conversations. In a careful analysis of interactions between doctors, 
patients, and family in the delivery of bad news, Maynard writes: “We must 
pay attention to the full sociolinguistic environment surrounding the bearing 
of bad news" (Maynard 1992:118). The evidence from Neville's text on hyper- 
correct grammar is mixed. If anything, the text contains several vernacular 
forms. “Me family life" in clause 17 should be “my family life.” In “Being 
homeless, she wouldn't let me" (clauses 3 and 4), the semantically implied 
subject of the first clause is "T" but syntactically it is “she.” Coates argues that 
hypercorrect language is more characteristic of women than men (Coates 
1993:66—86); if anything, working-class men have a linguistic preference for 
nonstandard, vernacular language. That being the case, Neville's use of ver- 
nacular language is quite characteristic of both his gender and class. Not so 
characteristic, however, is Neville's use of formal adjectives (e.g. apprehen- 
sive) and of complex syntactical constructs (e.g. clauses 1 and 3 and the adver- 
bial clause 14). Neville's text seems to contain a peculiar mix of lowbrow and 
highbrow linguistic codes. 

Notwithstanding some contradictions in Neville's language, facts and lan- 
guage by and large go hand-in-hand in Neville's story, to project an image of 
Neville as a powerless character. Interestingly enough, for all the strength of 
Neville's wife as a character, for all the power she wields (by contrast), ulti- 
mately it is not up to her to reverse Neville's fortune. It is Shelter! Although 
Neville's wife is the grammatical subject (and the agent) in clauses 14, 15, and 
16, the subject of the very last clause in that sentence (clause 17) is "it" (pre- 
sumably Shelter, the only “it,” i.e. the only non-human and inanimate charac- 
ter in the story with agentative characteristics). It is it—Shelter—that “lets” 
Neville rebuild his family life. In fact, in the very last clause of Neville’s narra- 
tive, Shelter has replaced the wife as the agent we had come to expect as being 
associated with the verb “let.” 

It is that sudden and awkward transition in grammatical subjects in the con- 
joined clauses of the last clause that may leave the reader wondering whether, 
when Neville says “it’s let me rebuild me family life,” he does not mean, in 
fact, that he has moved back 1nto the house. Neville does not tell us that. But 


l7Wrtten text, of course, does not provide any information on accent and pronounciation. 
Sociolinguists have shown that, perhaps even more than any other linguistic markers, 
pronounciation can offer invaluable clues about the social background of individuals (Coates 
1993:61—86; Romaine 1994:69—75) 
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neither did he tell us that he went to see Shelter. All he told us in clauses 6 and 7 
was “A friend suggested I go to see Shelter.” From there, Neville jumped di- 
rectly to Shelter. There is a “gap” in the text that the reader fills (“I went to see 
Shelter”).!8 Is Neville introducing another similar gap at the very end of his 
narrative? Only the deictic element “now” in clause 16 excludes the possibility 
that Neville has moved back with his wife. *Now" anchors the text to a con- 
crete spatial and temporal situation (now = Neville at Shelter = rebuilding fam- 
ily life inside Shelter), which may shed some new light on the story point. 
Could Neville's narrative be just an advertisement for Shelter? The adjectival 
traiting of Shelter, the role of Shelter as deus ex machina of the narrative, and 
the ambiguity of the last clause certainly point in that direction. In that sense, a 
more explicit reference to Neville's return home in the last clause would have 
taken the referential focus away from Shelter, something no advertisement can 
afford. 

To bring Neville's story into sharper focus as, perhaps, an advertising text, 
consider the following narrative: “You were leading a miserable life, without 
any friends and anyone to go out with, all because you had dandruff. But now, 
thanks to Best Shampoo, dandruff is gone and you have plenty of friends and 
dates." Best Shampoo (or Shelter) offers the magical solution to one's prob- 
lems. When viewed that way, Neville's reversal of fortunes from bad to good 
brought about by the intervention of Shelter conforms quite closely to Grei- 
mas’ basic narrative model: A subject (typically the hero) strives to win over 
an object (a beautiful princess) against the opposition of a villain but with the 
help of a friend or relative (helper) and the magic intervention of a sender [a 
sort of superhelper in Toolan’s words (Toolan 1988:93)]. In Neville’s story, 
Neville is acting as the subject missing his son Ricky (object) whom he cannot 
see because of his wife’s interdiction (villain); with the help of a friend 
(helper) Neville gets 1n touch with Shelter (sender), which will allow Neville 
to attain his goal. Vestergaard & Schroder (1985:27—32, 94) showed that such 
a model is indeed one of the typical models of advertising narratives, where the 
advertised products take on Greimas’ actant role of giver/superhelper. “Adver- 
tising ... does not try to tell us that we need its products as such, but rather that 
the products can help us obtain something else which we do feel that we 
need”—health, success, friends, and the like (Vestergaard & Schroder 
1985:29). 

In many advertisements, the power of the magic superhelper to bring about 
change in one's life is dramatized via the use of colors: The before 1s typically 
in black and white, the after in color—the temporal framework of the technical 


IBGaps can be introduced to speed up the reading process—as in this case—or to arouse the 
reader's curiosity On the linguistic concept of “gap” sec Rimmon-Kenan (1983.127—29) and Perry 
(1979). 
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development of photography providing the simple temporal reading of the ad- 
vertising narrative. Thus, in the advertisements for Virginia Slims cigarettes, 
the picture typically contrasts a black-and-white, small frame of a woman's 
life in the old days—a life of drudgery and hard work, particularly when com- 
pared to a man's life—to a larger and colorful frame of a woman's life now, a 
life of success and independence, as underscored by the fact that she can freely 
smoke (1). 

There are no explicit references to colors in Neville's narrative. Yet, con- 
sider the role of the adjective "brilliant" 1n clause 10. Coming after the images 
of gloom and doom conjured up in the reader's mind by Neville's description 
of his homelessness, the adjective “brilliant,” although used in reference to 
Shelter staff, floods the narrative space with bright light, ultimately opening a 
glimmer of new hope in Neville’s life. The before (gloom and doom) and after 
(“brilliant”) are starkly separated by the use of that adjective of light. The bef- 
ore and after are further underscored by the sudden shift in the use of verb 
tenses from the past tense to the present tense of clauses 11 through 13 (it's ...& 
hotel; it’s ... clean; ... are great). 

If Neville’s story is nothing but an advertisement for Shelter, then perhaps 
Neville is a fictional character. He is not the author of his story. There is a real 
author behind the story. Chatman introduced different levels of narrators and 
narratees [Chatman (1978:151); see also Rimmon-Kenan (1983:86—89); Co- 
han & Shires (1988:89—94); Toolan (1988:76-80)]: 


real author — implied author — (narrator) — (narratee) — implied reader — 
real reader 


But that real author has hidden behind Neville, leaving no explicit markers of 
his/her role—after all, what Shelter can do for the homeless is all the more 
credible if it comes from a real character. (On verisimilitude, see Chatman 
1978:48—53.) 

And that quest for realism leads to a muffled authorial voice. A muffled 
authorial voice is a purely rhetorical device. It is the same device that authors 
normally use in scientific writing. To achieve objectivity, authors silence their 
authorial voice. Objectivity is “absence of clues to the narrator ... [a] referen- 
tial illusion” (Barthes 1970:149) achieved by suppressing the emotive and co- 
native functions of language and emphasizing the referential function (Jakob- 
son 1960:357). In scientific writing, the authors’ direct intervention is minimal 
if not nonexistent. Historians have worked hard at denying themselves a voice. 
Their narrative mode of writing puts them dangerously close to narrative fic- 
tional writers—dangerously close, that is, for anyone expected to tell a real 
rather than fictional story and, thus, to be more than a mere storyteller. “A nar- 
rator is no longer present [in historical writing]. ... The events appear to narrate 
themselves" (Benveniste 1971:206—8; see also Cohan & Shires 1988:93). His- 
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torical discourse is “a discourse without ‘Thou’.... In historical discourse 
destination-signs are normally absent" (Barthes 1970:148). 

Yet, the author of Neville's text has only partially succeeded in this. Just 
like Neville himself, the “real” author has left behind clues of his/her presence. 
The many gaps in the narrative suggest that, perhaps, the author was trying to 
compress the text, to summarize it. The beginning of Neville’s story is strongly 
marked (i.e. unusual). We do not typically start a narrative in the form: “After 
my wife kicked me out.” Rather, we might say: “This and that happened, then 
my wife kicked me out.” “A friend suggested I go to see Shelter” would be fol- 
lowed by “so I went.” Truly direct speech (by Neville) would also contain 
many utterances of the type “ehm,” “well,” “you know,” “so ... then” com- 
monly found in colloquial discourse. 1? 

The real author has also introduced linguistic expressions that are atypical 
of colloquial direct speech, particularly for low-status individuals. The com- 
plex, syntactically subordinate clauses 1 and 3 and’ the adverbial clause 14 
seem out of character. The adjective “scared” would have sounded more real- 
istic 1n Neville's mouth than the formal synonym “apprehensive.” “T lived in 
my car for several weeks” would have been more likely than “I spent several 
weeks living in my car.” “Because I was homeless,” or “I did not have a home 
... SO... my wife wouldn't let me see my son” would have replaced the hypotac- 
tic “Being homeless." Perhaps Neville’s text is the result of an educated writer 
trying hard to be Neville-like, to imitate the speech of a homeless person by 
putting words into the mouth of an imaginary Neville, or the result of that same 
type of writer tampering with the narrative (e.g. an interview) of a real Neville. 
The apparent contradictions noted above ın Neville's mixture of both powerful 
and powerless language find a plausible explanation when we take into consid- 
eration issues of authorship. 

Both a structural analysis of Neville's story and one that pays close atten- 
tion to the linguistic nuances of the text have helped us shed light on the story 
point. The story may simply be an advertisement for Shelter rather than a male 
manifesto against women (or both). Indeed, the double reversal of a situation 
in Neville's narrative seems to support both story points. Neville's being 
kicked out of the house by his wife marks the first reversal, the first disruption 
of an equilibrium, the worsening of a situation characterized by homelessness 
and loss of family life, in a word, the beginning of tragedy. Neville's moving 
into Shelter marks the second reversal, the improvement of the situation that 
ultimately leaves the reader on a happy-ending note. Neville's wife plays a 
crucial role in this first reversal of fortunes, good—>bad, which may lead to a 


I?That could be the result of poor transcribmg, however, wherein an inexperienced interviewer 
has cut out many of the speaker's colloquial utterances, for a basic introduction to the problems of 
transcribing ethnographic text, see Kohler Riessman (1993 56-60). 
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reading of the story as a male manifesto. Shelter, the organization for the 
homeless, plays a pivotal role in the second reversal, bad—>good, which may 
lead to a reading of the story as an advertising text. But in this double reversal, 
the first reversal is backgrounded and the second foregrounded. By parallel- 
ism, Neville's story foregrounds its advertising role and backgrounds its role 
as a male manifesto. 

Perhaps if we had more information on the context of Neville's narrative we 
might be able to draw firmer inferences on the story point. Under what condi- 
tions was the text produced? All we know, besides Neville’s story itself, is that 
the story comes from The Oxford Independent. That information excludes a 
range of options as a source of the excerpt. For sure, the narrative does not 
come from a social science investigation of the type that Elliott Liebow (Tell 
Them Who I Am) (Liebow 1993) and Peter Rossi (Down and Out in America) 
(Rossi 1989) have masterfully put together on the subject of homelessness. 
Particularly in Liebow we find many excerpts of Neville’s kind: “One day I 
was a productive and respected citizen, the next day I was dirt,” says Shirley to 
Liebow (Liebow 1993:217). Too close for comfort? Sandford, the author of 
Down and Out in Britain, put 1t pungently: 


I descended into the bilges of society. Wearing boots that gaped at the seams 
and an ancient great-coat, I allowed my beard to grow and my hair to become 
matted with dirt. I wanted to meet and talk with down and outs.... And I 
sensed a feeling of deep insecurity, the deep fear that I too might all too soon 
end up, derelict, on Britain’s skid-row.... (Sandford 1971:9) 


Not surprisingly, sociologists (and social scientists in general) have tried 
hard to put some distance between themselves and the people they study, both 
in their methods (e.g. surveys on the basis of questionnaires not directly ad- 
ministered by the “scientist”) and in their language (e.g. talking about respon- 
dents as the “object” of study). (For a feminist critique of this last point, see 
Graham 1984.) What is of interest here is how science underscores objectivity 
at the level of language through the systematic elimination of any signs (a) of 
the subjectivity of authorship and of a direct relationship between author and 
reader (Barthes & Jakobson’s “referential illusion”), and (b) of any relation- 
ship between the real authors (the “scientists”) and their subjects (turned into 
objects) of inquiry (the double illusion of science). The development of a spe- 
cialized jargon to express common concepts is another effective distance de- 
vice (Lakoff 1975:53—57). 

The name The Oxford Independent seems to suggest that the source of the 
excerpt is a newspaper. After all, The Independent is one of England's quality 
papers. The Oxford Independent could just be a local paper based ın Oxford. If 
so, what kind of newspaper is The Oxford Independent! A daily, weekly, 
monthly? The fact that the issue is numbered (Number 12) and the fact that ref- 
erence is made to a month (January), rather than a specific day (e.g. 1/3/1996) 
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seems to point to a monthly publication. Perhaps even more telling is the year 
of publication ofthe article (1997). Together with the place of publication (Ox- 
ford, England), this information helps to contextualize Neville's misfortune. 
As it turns out, Neville was far from being alone in his homeless status in those 
years. Although it is no simple matter to get reliable estimates of the number of 
homeless people, it is widely acknowledged that (a) the number increased sig- 
nificantly throughout the 1980s and early 1990s in England, and that (b) home- 
lessness became more visible, with makeshift beds in subways, shop door- 
ways, and parks.29 

Between 1975 and 1985, and then again in the early 1990s, the rate of un- 
employment soared despite changes in definitions (Gallie & Marsh 1994:3). 
On that score, England followed a path similar to that of other industrialized 
nations. What was more peculiar to England was that the policies of the 
Thatcher governments were systematically dismantling the welfare state, with 
devastating consequences for those living on the edge of poverty. As the poor 
were getting poorer, the richer were getting richer. Income inequality sharply 
rose in the United Kingdom from the late 1970s onward, reaching levels sec- 
ond only to those of the United States in the industrialized world (Johnson & 
Webb 1993; Atkinson 1997). 

Why was The Oxford Independent running Neville’s story? Could the ex- 
cerpt be part of a larger reportage on homelessness? After all, one of the most 
compelling accounts of homelessness came from the English journalist and 
writer Jeremy Sandford in his book Down and Out in Britain, which inspired 
Rossi's own title. Alternatively, was the story part of a single article on home- 
lessness? Since human-side stories are typically run around Christmas time, 
the date of the story (January) seems to confirm that assumption. 


THE ROLE OF THE READER 


On that scene of last-minute Christian pity, let us take leave from Neville and 
his world—the microcosm of his personal life and the macrocosm of British 
society at the turn of the second millennium. I could have said more, no doubt, 


20tt is difficult to quantify precisely the extent of or increase in homelessness The 1991 Census 
provided the first official record with 2,827 homeless people and 19,417 hostel dwellers (Office of 
Population Censuses and Surveys 1993). Occasional surveys suggest higher figures of 5,000—6,000 
homeless (Shelter 1991) and 50,000 hostel dwellers (Connelly & Crown 1994:15) Other official 
statistics exclude "single" homeless people, because single homeless do not qualify for housing 
under the terms of the 1977 Housing and Homeless Persons Act, the 1985 Housing Act Part IIT, and 
the 1996 Housing Act. Indirect measures also provide an indicator of the increase ın homelessness. 
For instance, between 1971 and 1986 there was a seven-fold increase in housing applications from 
homeless people according to official statistics (Shanks & Smuth 1992:35). The number of 
households statutorily accepted as homeless has also increased, from 63,013 (1978) to 111,757 
(1985) to 169,966 (1992). 
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but my page limit is up. But within those limits, I hope to have given the reader 
enough token evidence on why (and how) sociologists should be interested in 
narrative. The analysis of Neville's story has brought out a wealth of both lin- 
guistic and sociological information. 

“The analysis of Neville’s story,” but, really, come now! The analysis has 
brought out nothing; Roberto Franzosi has, in reading Neville’s narrative— 
one time, two times, many times, “with steadily greater selectivity and atten- 
tion ... [and forming] tentative hypotheses about the nature and intention of 
whatever was noted” (De Beaugrande 1985:55), I have brought out that wealth 
of both linguistic and sociological information.*! And you—the reader—will 
have surely done the same, refracting Neville’s story onto my story and ulti- 
mately building your own story. 

Whatever else the analysis of Neville’s story may have shown, one thing it 
has clearly shown: Our ability to understand and fully grasp the meaning of 
even such simple text as Neville’s story is inextricably linked to a wealth of 
background knowledge that readers consciously or unconsciously bring to the 
text in the construction of meaning. “No knowledge without foreknowledge,” 
the hermeneutics scholars maintain (Diesing 1991:108). Our ability to read 
Neville’s story as an advertising text fundamentally depends upon our “fore- 
knowledge" of advertising codes.** Our ability to pick up the deeply ingrained 
male viewpoint in Neville's story depends upon our linguistic competence in 
teasing out meaning embedded in language nuances. Finally, our ability to go 
beyond Neville's microcosm depends upon our knowledge of the social rela- 
tions of his macrocosm (the interplay between text and context). 

Perhaps, when reading Neville's story, the average reader will not think of 
Culler, Liebow, Rossi—the linguistic and sociological paraphernalia of my 
own text. Eco distinguishes on this point between a “naive” and a “critical” 
reading of a text, "the latter being the interpretation of the former" (Eco 
1979:205). There is never a single message uniquely encoded in a text; there 
are several messages (“a network of different messages") as decoded by differ- 
ent readers endowed with different "intertextual frames" and “intertextual en- 
cyclopedias," and different reading codes (Eco 1979:5). 


2l indeed, according to hermeneutics scholars, text comprehension 18 an iterative process (the 
“hermeneutic circle”): A reader will (a) approach a text with some hypotheses in mmd about the 
text, (b) search for evidence on those hypotheses in a reading of the text, (c) set up new hypotheses 
about meaning; (d) restart the reading process (Diesmg 1991 109, 121). On the text and its reading, 
see also Rimmon-Kenan 1983 117—29, in particular 119—22, and Cohan & Shires 1988,114—33, on 
the oe of text reading as a system of hypotheses, see Perry 1979.43 and Culler 1975 

” 1s the concept used by Barthes 1n his S/Z (Barthes 1990), Eco talks about "mtertextual 
frames" and Perry of "frames" (Eco 1979; Perry 1979:36) This process of assimilating the meaning 
of a text through the familiar 18 called “naturalization” by Culler (1975:138) 
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The role of the reader is far from passive. Narrators and authors, of course, 
may try to build a “preferred” reading of their texts. Neville’s story is undoubt- 
edly told from his point of view. For one thing, Neville is the narrator of the 
story. More to the point, Neville tells the story from his perspective. We know 
how Neville feels and what he has had to go through. But we do not know how 
his wife feels, the difficulties she had to face as a single parent. Had the wife 
told the story, she probably would not have chosen to begin it the same way as 
Neville, leaving out the reason for kicking him out. Perhaps the story is told 
from the point of view of a homeless organization in search of publicity. 

Whatever the author's "preferred" reading, readers bring in their own pref- 
erences, their own points of view. Texts are hardly ever so “closed” as to allow 
only one type of reading to the exclusion of all others. (On the concept of 
"open" and "closed" texts, see Eco 1979:8—11.) Even when they try to be, the 
outcome of their reading by a different “model reader" is unpredictable. “No- 
body can say what happens when the actual reader is different from the 'ave- 
rage' one" (Eco 1979:8). Contrary to the structuralist view of a text as a pro- 
cess closed by the author in production, Culler, Eco, and others see the text as 
something actually produced in the reading process. “The reader as an active 
principle of interpretation is a part of the picture of the generative process of 
the text” (Eco 1979:4).23 

To Neville's text we bring our attitudes toward homelessness and gender. 
According to Millar (1982), in the 1970s the English public perceived home- 
less people as drunks, but by the 1980s they viewed them as “wandering mad 
people" and they blamed their presence on the streets on the closure of mental 
hospitals and the failure of community care, The result of an unplanned experi- 
ment has similarly revealed the role of our gender attitudes on the interpreta- 
tion of texts. In discussing Neville's text in a graduate class, the women in the 
class were much less likely than the men to automatically grant Neville their 
pity, to unconditionally take his side regardless of what he had done to deserve 
his present condition. They were much more likely to view the woman in the 
story notas a ruthless wife but as a responsible mother. Neville must have done 
something heinous to drive such a woman to kick him out—a wimpy character 
who, while looking for our sympathy, does not even have the courage to tell us 
what he did. It is clear from the unfolding of the story that the wife's actions are 
motivated not by anger or revenge toward her husband, but only by her son's 
well-being. She is certainly not using Ricky as a weapon against Neville. No, 
Neville's story is just another story told by a man, in men's image, and for 
men's consumption. 


23S ignificantly, the first section of the "Introduction" to Eco's collection of essays in The Role 
of the Reader bears the title “How to produce texts by reading them” (Eco 1979 3) 
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The point is: Understanding of even the simplest text requires a great deal of 
background knowledge, an “intertextual encyclopedia" in Eco’s words (Eco 
1979:7, 208). In fact, some would argue that our very ability to understand, to 
really understand Neville's story depends upon an empathetic understanding 
of the “other,” upon having shared the same experiences as the “other”—a dis- 
tinct kind of knowledge sociologists and philosophers of knowledge have re- 
ferred to as verstehen. 


CONCLUSIONS 


Historian Carlo Ginzburg closes with the following lines from a wonderful lit- 
tle book on the trial by the Inquisition of a poor sixteenth-century miller called 
Menocchio: “About Menocchio we know many things. About this Marcato, or 
Marco—and so many others like him who lived and died without leaving a 
trace—we know nothing” (Ginzburg 1982: 128). Perhaps I can close similarly: 
About Neville we know very little. All we know 1s what he has left behind in a 
short narrative of a moment of his life. In fact, we cannot even be sure that 
Neville is a real or fictitious character. We do not know what the purpose of the 
story 1s. 

Yet, the linguistic analysis of Neville’s narrative has allowed us to shed 
light on many real lives like his. Narrative analysis has not only revealed the 
close relationship between the words in a text and between a text and other 
texts (e.g. stories and advertisements). Narrative analysis has brought out rela- 
tionships between people—texts do not just index a relation between words 
and between texts, but between text and social reality. Sociology has crept in 
behind linguistics. Neville’s simple (and perhaps, fictitious) narrative has 
sparked our sociological imagination; it has allowed us to get a glimpse of the 
broad social relations (especially of gender and class) of British society at the 
turn of the second millennium. 

That, of course, may be too brazen a claim in the eyes of the sociological 
“scientist.” Ginzburg ana his fellow historians may deal with the life of Me- 
nocchio. Linguists may spend pages and whole tomes arguing over the struc- 
ture and meaning of a four-line text. Other members of the academic commu- 
nity may make their living on the basis of a single story. But sociology is about 
discovering general laws. Sociology is a science. It is not interested in the par- 
ticular; its objective is the universal.2* We can hardly find any interest in this 
man’s life, a man we do not even know is real. Much of the sociological debate 
between qualitative and quantitative approaches has centered on the issue of 


24On these issues, see Windelband’s 1894 inaugural lecture (Windelband 1980) and 
Rickert— Windelband's teacher (Rickert 1962 55-56). 
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sample size, of the small "N" of ethnographic approaches. The analysis of 
Neville's story may just be an extreme example ofthe study of the particular.25 

That view of the sociological enterprise has led sociologists to their own 
way of approaching narrative texts: content analysis (for all, see Küppendorf 
1980). Notwithstanding the ethnomethodological approach to text, sociolo- 
gists have typically not been interested in (nor do they have the theoretical and 
methodological tools for) analyzing the linguistic nuances of a text—what can 
one text tell them about broader social relations anyway? Nor have they been 
interested in the invariant, structural patterns of narrative—yes, it is patterns 
that sociologists are after, but not patterns of texts (that’s linguists’ business), 
rather patterns of social relations. (On this point, see also Todorov 1981:5—6). 
And in their search for patterns of social relations, they tease out of a text the 
common threads (“themes”)—-common to the texts, but as they apply to real 
human beings—and then they count and tabulate those themes (how many of 
these, how many of those). Alternatively, they provide snippets of those com- 
mon themes (think, indeed, of such beautiful accounts as Erikson’s and Lie- 
bow’s). In analyzing “respondents’ stories,” sociologists cut up individual sto- 
ries and recompose the pieces into new stories, with the coherence and context 
of each original narrative lost and forgotten. Upon the new stories, sociologists 
then impose the coherence of the "scientific" ethnographic text in the context 
of sociological “literature.”26 

Yet, “precisely because they are essential meaning-making structures, nar- 
ratives must be preserved, not fractured, by investigators, who must respect re- 
spondents’ ways of constructing meaning and analyze how it is accomplished” 
(Kohler Riessman 1993:4). That, of course, 1s easier said than done. Such close 
analysis of a handful of short texts is possible. But where does that leave re- 
searchers confronted with large bodies of narrative data (e.g. ethnographic ma- 
terial or unstructured interviews)? At the current state of linguistic formaliza- 
tion and computer-software development, there may be no escape from one 
form or another of the kind of thematic analysis proposed by content analysis 
[even when these themes or “concepts” are represented in terms of network re- 
lations (see Carley. 1993)]. 

Ultimately, each technique has its advantages and limitations. Researchers 
should be aware of what is gained and lost with the use of each technique. To 
the extent possible, researchers approaching the study of textual material (in 


25] should pomt out, however, that linguists have been no less interested than sociologists m 
building general theories and that they have been very successful 1n building such theories on the 
basis of case studies. The real issue, in fact, is not the number of cases scientists take into account in 
building their models, but, whether individual cases fit into a general model 

26For a brief sociological introduction to narrative, see Kohler Riessman (1993) and the 
literature cited there, for a more ethnographic view, see Van Maanen (1988); for an epistemological 
feminist manifesto on narratives, see Graham (1984). 
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fact, not just those) should use a variety of methods. Even simple frequency- 
counts of words have their value when properly used and backed up by other 
forms of evidence and analysis. As usual, what really counts is not the methods 
used but the questions asked. 

Given the enormous difficulties and ambiguities that we have encountered 
in understanding and extracting meaning from even such a simple text as Nev- 
ille's story, some of the key words that appear in most definitions of content 
analysis— "objective," “systematic,” “scientific,” “quantitative,” “replicable 
and valid inference," “explicitly formulated rules"——will strike any reasonable 
reader as overly optimistic if not altogether misguided (see Shapiro & Markoff 
1997 for a quick summary of definitions). In light of Eco’s “network of differ- 
ent messages," “intertextual frames," and “intertextual encyclopedias” that go 
into the reading process (Eco 1979:5), how could we ever hope to squeeze it all 
in a handbook of coding rules that did not itself look like an encyclopedia? The 
emphasis on “objective,” “systematic,” or “scientific” in the process of going 
from text to coding may succeed in drawing attention away from the murky 
waters of text understanding, but ... it does not get us any closer to finding real 
solutions to the problem. The scientific claims of content analysis must not ref- 
er to this central aspect of the technique but, perhaps, to other, more peripheral 
aspects. That 1s indeed the case. Lasswell, one of the “founding fathers” of 
content analysis, concentrated his “scientific” efforts on such issues as statisti- 
cal sampling of texts, design of coding categories, validity and reliability, unit 
of analysis, and methods of data analysis, but did not tackle the fundamental is- 
sue of meaning [see the many chapters in Language of Politics dedicated to 
methodological issues (Lasswell et al 1968)]. Taking the part for the whole—a 
rhetorical device or “figure of speech” technically known as synechdoche, one 
of the four “master tropes” with metaphor, metonymy, and irony—scientists 
effectively generalize their scientific claims. 

The debate on the sex of the angels occupied some of the most brilliant 
minds of the Middle Ages. Within the Christian culture of medieval European 
societies, that issue addressed central concerns of dominant cultural frame- 
works. Has the current culture of scientific or pseudoscientific discourse 
blinded us to the point that we actually believe some of things we are writing? 
(As an exercise in the rhetoric of science, count the number of times such key 
words as “rigor,” “power,” “precision,” and the like appear in my published 
work on semantic grammars). Perhaps, in light of the themes discussed in this 
chapter, what we need is more “open texts," scientific texts that are open to the 
conditions of their own production. Hopefully, in this process of self- 
reflectivity, we will not have fallen prey to postmodernist gibberish, nor will 
we have given up an honest search for rigor in the social sciences. 

The narrative analysis of Neville's story also points to a different way of 
looking at the relationship between the micro and the macro, the particular and 


55 35 66 


550 FRANZOSI 


the universal. The process of contextualizing a text for narrative understand- 
ing—the foreknowledge of knowledge—quickly leads us away from Neville's 
microcosm to the macrocosm of British society. That same process quickly 
leads us away from narrow linguistic concerns. I have provided a handful of 
examples on how to link a linguistic analysis to a sociological analysis, how to 
go from text to context, from Neville's particular to the universal. 

For sure I have picked and chosen here and there to shed light on context, 
selectively drawing from the “intertextual encyclopedia" of knowledge. The 
linkages I have explored between text and context, micro and macro, are more 
tentative and informative than definitive. Readers who approach narrative 
texts with more strongly substantive-driven problems will no doubt pursue 
those linkages more systematically. My goal was modest: Show the reader 
how to raise sociologically informed research questions and how to pursue 
them, rather than answer them, starting from a narrative text. Following that 
goal, I hope to have also shown that narrative analysis (broadly conceived here 
as the analysis of both linguistic and extralinguistic characteristics of speech 
acts) yields an understanding of social relations as embedded in linguistic 
practices. Many of the studies reviewed here have shed light on how specific 
linguistic mechanisms underline social relations of gender or class. Unlocking 
those mechanisms in one particular social setting may ultimately provide 
greater knowledge than that based on tenuous statistical relations between 
poorly measured and even more poorly understood concepts in the context of 
poorly estimated models, even if the rules on how to go from the particular to 
the universal are scientifically embedded in the procedures we use. (Think, for 
instance, of sampling statistics.) In that sense, narrative analysis may have also 
shown how social scientific practices involve specific language games in rela- 
tion to the people we draw information from (our subjects/objects of study) 
and pass information to (our readers). 

To the novice, all of this will surely sound like a daunting task (perhaps it is 
easier to let the computer run regressions). The understanding of the text has 
required us to zoom down on linguistic problems. The understanding of the 
context has required us to open up to neighboring and distant disciplines, to 
harness knowledge that comes from far afield. Don't despair! The good news 
is that literary competence is not intuitive but learned (Culler 1975:113—30; 
Toolan 1988:29; Cohan & Shires 1988:22; see also Bourdieu 1984:399). And 
so is the “competence” of linking a narrative analysis to a sociological analy- 
sis. Just start from Statistics 101 ... sorry ... Narrative 101. 
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